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ATOMIC METAPHYSICS * 


According to Dr Seuss, the father of the father of Nadd “knows everything 
in all [the] kingdom, in all the world beyond, and in all the other worlds 
that may happen to be."——-Ths Frve Hundred Hats of Bartholomew Cubans 


s a simple statistical model, consider a pair of coins repeat- 
A edly flipped. From experience, the usual frequency is heads- 
heads (or tails-tails) one time in four, and headstails one 
time in two. A second possibility is that typical in quantum mechan- 
ics; heads-heads, tails-tails, and heads-tails each occur one time in 
three. Often these frequencies are known as Maxwell-Bolizmann and 
Bose-Hinstein, respectively. It is suggested—in particular by standard 
physics texts—that this difference is the important philosophical in- 
novation of quantum mechanics for statistical physics. The aim of 
this paper is to construct a precise framework for judging this claim, 
one which reveals a considerable confusion in standard pronounce- 
ments. More precisely, I shall demonstrate that the occurrence of 
Maxwell Boltzmann statistics in classical mechanics has none of the 
metaphysical implications that it is often thought to have. In so do- 
ing, I show that the terms ‘Maxwell-Boltzmann’ and ‘Bose-Einstein’ 
are themselves ambiguous, so I shall not use them until matters have 
been clarified. 
If we suppose that these statistics are reflections of a uniform mea- 
sure over universes of possible worlds—distinct physical states—then 


* This paper is based on work submitted for a Ph.D. at Rutgers University. Ear- 
Her versions have been presented at the University of Manchester, Cohumbia Unt 
versity, the Univermty of Kentucky, the University of IlHnois/Chicago, the London 
School of Economica, and Leeds University. I wish to thank all concerned for their 
hospitality and helpful comments. Special thanks to Robert Weingard and Tim 
Maudiin. 
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the two kinds have different implications for the universes to which 
they belong. The first, 1:2:1, implies that the modal universe con- 
tains two states that are heads-tails for every one that is heads-heads. 
Of course, this is not a surprising result in our world, for ‘heads-or- 
tails’ describes only a few of any coin’s properties. There is only one 
way for both the quarter I got in my change at “K-Mart,” and the one 
from “The Frog and the Peach” to be heads, but the “K-Mart” coin 
could be either heads or tails if they fall heads-tails. Things become 
interesting when we have statistics for a complete set of properties. 
For instance, consider the kind of qualitatively impoverished world 
in which the only positive, logically simple properties available to a 
coin are heads and tails. If there are now two states in which we find 
the result heads-tails, then there are two qualitatively indistinguish- 
able worlds that cannot be identified. Does the difference not lie 
purely in the way that properties are distributed over individuals? 
Now this coin is heads, now that. 

We appear to have an argument that takes us from an experiment 
involving coins to metaphysical conclusions about the structure of 
the modal universe of possible physical states. It seems that the ob- 
served statistics are to be explained by a mini physical theory that dis- 
tinguishes four possibilities, including two worlds (the heads-tails 
worlds) that are qualitatively indistinguishable and yet distribute 
qualities differently to individuals. Loosely, this implies a denial of 
the Identity of Indiscernibles across possible worlds: a distinct coin is 
tails in each of the two worlds. This denial is the doctrine of haec- 
certism, which I shall clarify below. What motivates this discussion, 
then, is the suggestion that this significant metaphysical issue’ can 
be settled experimentally. 

The job of section 1 is to explicate the notion of haecceitism, and 
to distinguish it from various other aspects of individuality with 
which it is easily conflated. In subsequent sections (u and m) we shall 
generalize from the coins example, and see in a rigorous way what 
haecceitism means in physically realistic systems. Section IV will out- 
line empirical arguments for and against haecceitistic representa- 
tions in classical physics. Sections v and vi will demonstrate why they 
fail; after all, there is no experimental way to determine whether the 
classical world is haecceitistic or not. In the final section, I shall ex- 
plore the consequences of this conclusion for the metaphysics of 
classical mechanics. 


1 Significant, for instance, to the theory of rigid dezignanon: Saul Kripke, Nam- 
mg and Necessity (Cambridge. Harvard, 1980), pp. 15-20, touches this point. 
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; I HAECCEITISM AND INDIVIDUALITY 

To evaluate the truth values of modal propositions, we need to know 
what things in one world, say, ours, are getting up to in another: for 
instance, is there any world in which I present a television game 
show? However this goal is achieved, following David Lewis* we shall 
say of someone in our world, that another world represents de re of 
him that he has whatever properties “he” (perhaps his counterpart, 
or he himself) has in that world. So, worlds in which “I” am on the 
box represent de re of me that I am a television personality. Now we 
can precisely state the doctrine of haecceitism: 


If two worlds differ in what they represent de re concerning some indi 
vidual, but do not differ qualitatively in any way, I shall call that a hasc- 
cestistic difference. Haeccatism is the doctrine that there are at least some 
cases of haecceitistic difference between worlds. An#t-Aaecceitism is the 
doctrine that there are not (iid, p. 221). 


Suppose we find conventional (one time in two) statistics for 
heads-tails, then (still supposing these to be the only properties that 
coins have) we might infer the existence of two worlds (both with 
one coin heads and one tails) that are qualitatively identical. But we 
take it that they do differ in their de re representations, for the 
heads coin in one world is tails in the other, and vice versa. Hence 
there is a haecceitistic difference between the worlds, and their 
moda] universe is haecceitistic. 

By way of further explication of this doctrine, recall what might 
motivate it in the first place. For instance, is it possible that ‘I could 
have been John Kuti’ is true? Not just in the sense that I could have 
gone to law school, that I could have had a friend called Nick 
Huggett, and that I could have moved to Moscow. Rather, could I 
have had the very same life that John has in this world? It seems that 
this kind of statement must be false on any account of my other- 
worldly selves which picks them out qualitatively. The person with 
John’s life will always be picked out as John, not me. But if we have a 
nonqualitative way of picking ourselves out in other worlds, then we 
open up the possibility that we could get up to different things in 
qualitatively identical worlds. So, asserting the truth of such state- 
ments seems to drive one toward haecccitism. 

Extreme caution is needed here, for it is very easy to slip from 
haecceitism into considerations of haecceitws;, differences in de re 
representation in the face of qualitative identity could be explained 


? On the Phurality of Worlds (New York: Blackwell, 1986), § 4.4. 
* For a discussion of the idea of “qualitative character,” see iid., p. 22. 
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by nonqualitative properties of “thisness” coinstantiated with quali 
ties in different ways. In one case, for instance, the haecceity this- 
ness-of-coin-1 might be co-instantiated with the atomic quality of 
being-heads, but in a qualitatively identical world, with being-tails.* 
But (as Lewis points out) haecceitism is neither necessary nor suff 
cient for haecceities. For instance, in two identical worlds of individ- 
uals that maximally violate the Identity of Indiscernibles—so that all 
the individuals in both worlds are in identical states—both worlds 
represent the same of any individual, and so haecceitism fails. But 
there could still be nonqualitative properties differentiating the indi 
viduals. On the other hand, differences in representation de re do 
not have to be explained in terms of haecceities, for they could be 
brute metaphysical facts. 

The point of this caution is to emphasize that we are investigating 
the well-defined notion of haecceitism, not the separate, murky no- 
tion of an haecceity. Our resolve will allow us to acknowledge that 
haecceitism is only one aspect of individuality and that other related 
concepts warrant separate investigation. In particular, we are inter- 
ested in the converse of logical identity, drversity, and Hans Reichen- 
bach’s® genidentity (in one sense of that term): the property of 
selfidentity through time. Now we can see how attempting to discuss 
individuality in terms of haecceities muddies the water. The problem 
is that they can be implicated in all its distinct aspects. One might ar- 
gue that atoms are distinct because they possess distinct haecceities, 
or that instances are genidentical because they possess the same 
haecceity. Diversity and genidentity, however, could also be brute 
features of individuals. If we drop the issue of haecceities, we can 
treat the different aspects of individuality separately. This is as well 
because, as I shall show at the end, they are not equivalent. For now, 


let us concentrate on haecceitism. 
IL REPRESENTATIONS 


Consider Democritean worlds, those worlds consisting of basic ele- 
ments which we can call atoms: our qualitatively impoverished coin 
worlds or the point-particle world of classical mechanics for instance. 
In this section, I shall discuss what possible states such a world might 
have: or more precisely, what states theories of such a world might 
represent there to be. Since “possible states” are nothing but posst 
ble worlds, the structure in which states are represented by a theory 


1 Clearly, for such worlds to be qualitatively identical, haecceities must be ex- 
cluded from the roltcall of qualities, perhaps on the grounds that they do not fea 
ture ın explanations of mature science. 

5 The Derection of Time (Berkeley: California UP, 1956), pp. 270-71. 
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is the structure of the universe of possible worlds postulated by the 
theory. The point of view of this paper is that haecceitism is thus a 
doctrine concerning such structures. 

First we treat the possible properties of a single atom: let there be 
m logically independent properties, each with its own range of possi 
ble values—its spectrum. I shall take it that any spectrum can be rep- 
resented on some portion of some number line. The obvious 
paradigm is spatial location represented by real coordinates. 

The state of an atom is the conjunction of the properties that it 
possesses at an instant. This suggests a way of representing the states; 
we form the m-dimensional single-particle state space, -space (for 
gas ‘molecule’ in traditional statistical mechanics), as the Cartesian 
product of the number-line segments. Different states are repre- 
sented by different points in space, or, when more convenient, by 
their m coordinates in the space. For instance, the discrete spectrum 
heads-or-tails has two p-states, which are represented by two cells 
(figure la). In classical mechanics in three-space, a particle’ has six 
independent dynamic properties, which may be chosen to be its spa- 
tial coordinates and the components of its momentum, and a state is 
represented by a point in six-space, that is, by six coordinates. The 
dimensionality of space is not important for present purposes; all 
we require is that for any atom, it contains a fixed number of distinct 
cells representing distinct states. 


iH 
H T 


h actos oa 
- coins—one 
Coin 1 
(b) T-space for a pair 
of colne—cotn 1 taila, 
coin 2 heads 


Figure 1: State Spaces. More complex systems are represented in fust the same way, 
except h- and Z-spaces are larger, and I -space has more dimensions. 


® Assuming that there are no internal degrees of freedom 
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Atoms may also posses constant properties, such as mass and the 
charge, but we shall be concerned only with homogeneous gases, 
composed of atoms of a single “species,” all with the same constant 
properties and identical u-spaces. We shall thus be able to ignore 
such properties: the atoms in the folowing discussion will differ 
qualitatively just in case they have different -states (in identical u- 
spaces). 

One-atom worlds are pretty dull. What are the states of a system of 
many atoms, a “gas”? The obvious answer is that they are the con- 
junctions of the states of each atom. They can be represented by as- 
signing a space to each atom, so that the global state space for a 
system of n atoms, | ’-space (for ‘gas’), is the Cartesian product of n u- 
spaces. For instance, for two coins I -space is composed of four cells, 
{heads, tails} x {heads,tails} (figure 1b). Or again, the state of three stæ 
tic particles in three-space will be represented by a point in a nine dt 
mensional configuration space. In classical statistical physics, where 
particles are described by position and momentum coordinates, I- 
space is the phase space. Following this convention, we can generically 
call states in I-space phases. 

But there is another possibility for the states of a many-atom gas: 
the state of the system is not the conjunction but the set of all the 
atomic states instantiated by the system. The representation of this 
alternative kind of state simply describes how many atoms are in 
each u-state, and so the state space has the same structure as [-space: 
this space is Z- (Zeta-) or distribution space. For discrete spectra, we 
associate each cell with a value equal to the number of atoms in the 
corresponding [-state. For instance, a state one-coin-heads-and-one- 
tails is represented by associating the value 1 with both the heads cell 
and the tails cell (figure Ic). Hence Z-states are not represented by 
points of Z-space but distributions over it. Similarly for continuous 
spectra: the Z-space of a system of particles is a distribution of parti- 
cle numbers over a single particle posiion-momentum space. In- 
deed, we shall talk of Z -states as distributions. 

Z-space then just lists what atomic properties are instantiated in 
what numbers, whereas I-space also describes which individuals pos- 
sess which properties. For instance, when I -space says “particles 1, 4, 
and 5 are over here, and 2 and 3 over there,” Z-space simply says 
“there are three particles over here, and two over there.” What is im- 
mediately apparent from this comparison is that there is a many-one 
mapping of I -space phases into Z -space distributions. For instance, 
the distribution is the same if now the phase is “1, 2, and 3 are over 
here, and 4 and 5 over there” instead. 
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How “many-one” is this mapping? Consider properties with dis 
crete and finite spectra. We suppose that an atom has c available 
states, so that -space is divided into c distinct multidimensional 
cells. A many-particle distribution can be expressed as the tuple 
<O,,0,,...0.>, where O, is the occupakon number of the cell: the num- 
ber of atoms in state 1 So how many phases have this distribution? 
The result is a standard piece of combinatorics, which is worth recall 
woi 

Let N,-(Q,,...0,) be the number of phases corresponding to a dis- 
tribution <O,,Q,,...0.>. Then, if n is the number of atoms in the gas: 


N,(0,,...0,) = ways of picking O, atoms X ways of picking Q, atoms 


out of to be in state 1 out of (n-O,) to be m state 2 

x ways of picking Q atoms ...X ways of picking O, atoms 
out of (n-O,-Q,) to be in out of O, to be in state c 
state 3 


But these terms are simply binomial coefficients, so: 
N-(Oy,-O,) = nt/0,1(n-O,)1 X (n-O,)1/Ol(n-O-0,)1 ~ X 0,1/0,1(0,-0,)1 
= nl/(O,10,1...0,1) = n/[ [0] eqn 1 


m=] 

The validity of this formula for a pair of coins is easily checked; for 
instance, for Op = Og= 1, we get Np= 2I/ (1111) = 2 headsails 
states. The corresponding quantity for Z-space is trivially unity, for Z- 
states just are distributions. If N,(O,,...0,) is the number of Z-states 
corresponding to the distribution <Q,,...0>, then: 


N,(O,,..0,) = 1 eqn 2 


Clearly, important modal considerations are making themselves 
felt at this point, for the values of these Ns are the numbers of possi 
ble global states compatible with a single qualitative description. 
Thus, we have nearly reached a central question: Does the qualita- 
tive character of an atomic world uniquely determine it, or is it yet 
incomplete in some way? Specifically, is Z-space adequate for de- 
scribing all phenomena, or is the extra structure of I -space re- 
quired? In the next section, I shall investigate why this is a question 
of haecceitism. 


? See, for example, D. ter Haar, Elements of Statistical Mechanics (New York: Holt, 
Rinehart & Winston, 1964), p. 27. The key notion w that of a benomial 
(B.A) = the number of distinct subsets of A elements contained in a set of B ele- 
ments = B/{Al(B-A)!)}. Also recall that while in general Al=1-2-3-...(4-2):(41)-A, 
we have the special identity that Ol=1. 
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First consider how empirical consequences might follow from 
these abstract considerations. The simplest way is by equipping these 
spaces with a probability function, a measure, and the simplest 
choice is a uniform distribution: indifference over the states. Then 
the fact that N-#N, will in general lead to differences in frequencies 
of states, something that one might hope to observe. So we might be 
able to tell which of I-space and Z-space actually capture possibility 
simply by seeing which of these statistics actually obtains. I shall re- 
turn to this point in section IV. 

Il. THE METAPHYSICS OF PHASE SPACE 
Drawing metaphysical implications from observed phenomena is a 
game with only one obvious set of rules: first, we determine the dif 
ferent modal properties of the universes of possible worlds described 
by different representations, and then we investigate whether the 
data favor one. In this section, I tackle the modal issue; in the next, I 
shall look at the data. 

First, we need to confirm that a Z-state gives a complete account 
of the qualitative character of a world: any two states with the same 
distributions are qualitatively identical.® I shall assume that single 
particle states are qualitatively complete by construction. That is, just 
as before we considered coins whose only properties are heads and 
tails (not mint year, denomination, and so on), now we take it that 
any possible qualitative differences at all between atoms are reflected 
in their U-spaces. There are no further qualitative differences: the 
atoms are not, say, also different colors. 

Now, worlds could have qualitative characters not determined by 
distributions of physical atoms: for instance, the dualist one in which 
a thirsty Descartes yearns for the cold beer that he believes is in 
the refrigerator. But even in this sort of example, there are no physt 
cal properties that supervene on [-space states, but not on distribu- 
tions, at least in worlds similar to ours. If this modest claim is true, 
then we can hold any nonsupervenient (and hence nonphysical) 
qualities fixed and then ignore them as we consider how worlds vary 
through universes of possibility; we shall not overlook any qualitative 
differences “arising from” the atoms. Hence, any two states corre- 


® A couple of cautions should be made here, for by looking at the structure of 
state space, I am not immediately discussing full diachronic possible worlds, but 
rather instantaneous “miniworlds.” This is not an issue at this point; for example, if 
Z- states give a complete description of the instantaneous qualitative character of a 
world, then a senes of Z-states gives a complete qualitative description of a dr 
achronic world. One might try to qualitatively distinguish identical Z-states in di 
achronic worlds by the histories that produced them, but I shall block this move by 
focusing on determunistic systems, with unique trajectories through any Z-state. 
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sponding to the same distribution have the same qualitative charac- 
ter, insofar as it is determined by atomic structure at all. 

This seems correct. We know of no properties that depend not just 
on there being atoms of certain masses and charges at x, y, and zat 
certain times, but also that it is a particular atom, delineated non 
qualitatively, that possesses the charge and mass and occupies point 
x. It is hard to see how one could offer a general a priori argument 
against such a property, but it is equally hard to imagine what would 
prevent us from viewing the dependence as being, not on a particu 
lar atom, but on some new unique physical property of that atom. 

Haecceitism then does not hold in the Z-universe, if we assume 
that the Identity of Indiscernibles applies between states. By the 
qualitative completeness of Z-space, any two qualitatively identical 
states have the same distribution, and thus by assumption are strictly 
identical. But a single world cannot differ from itself in de re repre- 
sentations concerning individuals, and hence haecceitistic differ- 
ences are not possible. 

On the other hand, |-space is far more promising, for it contains 
distinct yet qualitatively identical states; distinct phases may corre- 
spond to a single distribution. Further, we have naturally been um 
derstanding such distinctions as arising purely because, for instance, 
one phase represents that atom | is in state a, and the other that it is 
in state b. This is a difference in de re representation, so such distinc- 
tions are haecceitistic differences in Lewis’s sense. We shall thus take 
it that the doctrine of haecceitism holds in [-space, and that empiri 
cal evidence for I-space would be empirical evidence for haec- 
ceitism. Conversely, evidence for Z-space would be evidence against 
haecceitism. 

Now that we have all the metaphysics in place, we need to work on 
the other half of the basic metaphysical argument: that the data 


force a choice between Z- and I -spaces. 
IV. THE EMPIRICAL BASIS 


I now want to add some physics to the bare representations. | shall 
assume a Newtonian dynamics in which the complete qualitative de- 
scription can be given in terms of the continuously valued positions 
and momenta. The evolution depends on these properties only, not 
on labels. Thus Newtonian theory can be represented in Z-space or 
in I-space. The point is that the form of classical atomic dynamics 
does not immediately select one of the state spaces. We need some 
more physics: time-averaged statistics. 

The consequences of statistical physics are of two kinds, statistical 
and entropic, an important distinction here. In classical mechanics, 
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the former evidence speaks for haecceitism and the latter against it. 
Now, the exact frequency with which a given distribution occurs is 
not directly observable, but must be inferred from statistical proper- 
ties. As an idealization, however, I shall talk as if we have direct ac- 
cess to the frequencies. Since I show that this perfect knowledge 
does not select one of the representations—in realistic situations 
they concur—I can conclude that neither can the less ideal knowl 
edge obtained in practice. Entropy is different; if there were facts 
about the value of entropy of a system, then knowing the values 
could be of help. The thrust of the argument will be that there are 
no such facts. 

So, how do the statistical arguments work? The idea is that since 
there are many phases for any distribution, the two spaces predict 
different long-run frequencies. Once these are calculated, all one 
need do is observe which actually occurs. For the computation, first 
assume indifference in both spaces, so that the probability of a distri- 
bution will simply be the number of ways of realizing it divided by 
the total number of distinct states. Let there be Z distinct distribu- 
tions altogether’ and a total of I distinct phases in the whole of 
phase space. Then the frequencies of a given distribution, 
<O,,Oy,...0>, according to Z- and [-spaces respectively are, using 
equation two: 


F O,,..0,) = N,(O,,...0,)/all distributions = 1/Z eqn 8a 


F(O,,...0,) = N,-(O,,...0,)/all phases = N{O,,...0,)/I eqn 8b 


If these disagree for physical systems, then they effectively give us a 
way of empirically counting the number of physically possible 
worlds, and settling the issue of haecceitism. 

In this specific context, let me first say something about the as- 
sumption of indifference over the possible states. One might point 
out that without the assumption of indifference, Fy and F, could be 
chosen to be anything, including identical functions. In this case, the 
argument cannot get off the ground, and there is no more evidence 
for haecceitism than for antuhaecceitism. But to adopt this position is 
to step into a messy debate about prior probabilities. In the next sec- 
tion, I shall claim that the argument for haecceitism can be defeated 
much more solidly, so I shall not entertain this possibility. 

I shall take it that the data of classical statistical mechanics as in- 
ferred from thermodynamics reveal that the correct frequencies are 


* Forn es each with ¢ possible states, it is not hard to show that Z = (#+¢- 
1}!/nl(c-1)!—see, for example, Reichenbach, p. 230. 
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those given by Fn equivalently, the statistics are Maxwell-Boltzmann. 
The statistical argument infers back from the observed statistics, argu- 
ing that they provide confirmation of the correctness of I-space, and 
hence of the veracity of the doctrine of haecceitism. 

I claim that this is the standard position, adopted, for instance, by 
texts on statistical physics (for example, ter Haar, pp. 71-72). After 
all, the Gibbs paradox,’® which has exercised physicists so much, re- 
lies, in one of its forms, on the individuality of atoms. This position is 
also present in the views of certain philosophers: for instance, 
Michael Redhead and Paul Teller’ make it part of a traditional con- 
ception of classical particles (ibd. pp. 202-03), which they argue is 
inapplicable in quantum mechanics. Reichenbach suggests the same 
idea: 

..in classical physics, two arrangements are regarded as different 
when they result from one another by an interchange of molecules be- 
tween two given compartments. 


But, in quantum statistics: 


It was shown by Bose and Einstein that...we must give up regarding 
two arrangements as different when the particles in different compart- 
ments are interchanged (op. at, pp. 228-30). 


I shall say more about entropy later, but the important point is 
that it is a function of the number of distinct states, N (= N,;or Nz), 
corresponding to a distribution: the entropy, $ = S(N). Since N 
takes on different values in Z- and I -spaces, so will the entropy. Let 
us take it that there is prima facie evidence (to be discussed) from 
thermodynamics that N, gives the correct value for the entropy: that 
is, S = S(N,). The entropic argument infers back from this evidence, 
arguing that it provides confirmation of the correctness of Z-space, 
and hence of the falsity of the doctrine of haecceitism. 

Now that I have clearly stated the two contrary arguments, statisti- 
cal and entropic, I can proceed to see which, if either, prevails. Does 
the empirical evidence decide the metaphysical issue? 

V THE STATISTICAL ARGUMENT 
The statistical argument is not deductively valid; all that is shown is 
that [-space (and indifference) yields the observed statistics. It does 
not immediately follow from the confirmation of I-space that Z- 


For example, N. G. van Kampen, “The Gibbs Paradox,” in W. E. Parry, ed., 
Theoretical Physics (Oxford: Pergamon, 1984), pp. 303-12. 

H “Parucle Labels and The Theory of Indistinguishable Particles ın Quantum 
Mechanics,” Britisk Journal for the Philosophy of Saence, xum (1992): 201-18. 
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space is disconfirmed, unless one also shows that they disagree in the 
relevant experimental context. For the coins, they obviously do dis- 
agree (we get heads-tails either one time in two, or one time in 
three); but it is easy to see that in physically realistic cases the pre- 
dicted frequencies agree. Hence the observed statistics confirm Z- 
and I’-spaces, antihaecceitism and haecceitism, equally. 

The coins are irrelevant to settling the issue because, of course, one 
needs to consider atoms whose -spaces are qualitatively complete, un- 
like heads-tails for real coins. The kind of real system whose atoms 
have qualitatively complete ji-states (bar shared constant properties) is 
a homogeneous classical gas of rigid Newtonian particles. The key fea- 
ture of such a system which ensures the agreement of phase and distri 
bution is—as we shall see below—that no two Newtonian particles ever 


simultaneously occupy the same point of space: O, = 1 or 0 always. 
Result I: Given O,S1, distribution and phase spaces entail the same fre- 
quencies. 
Proof 
(i) Assume O,S1 for every single particle state, so O,!=1 for every 1 
Then, using equation one: 


NAO,,...0,) = n/[ [01 = al/(1° 1.1) = al 
im] 
Given the assumption, there are exactly n! phases to any distribution, 
and hence if there are Z distributions, the total number of phases, T, 
is niZ. 


(ii) Hence, by (i) and equations three: 


FAO,,...0,) = n/n Z= 1/Z= F,(O,,...0,) 


Given the assumption, the statistics implied by both Z- and I’-spaces 
for distributions are identical,” In this situation, statistics speak equally 
for both spaces, and do not force any choice. This is significant, for 
there are two good reasons to suppose the assumption is correct. 

First, it holds if atoms are impenetrable, for then they just cannot 
occupy the same location. But less problematically, it is also true if 
one of the atomic properties is characterized by a number line with 


1t Note that one need not assume indifference to derive these results, merely 
that the prior probability, p(O,,...0.), is the same function of distributions in both 
Note, too, that exactly the same results apply for functions of distributions: 


ics, tnchiding the Problem of Renormalization (Ph.D. Thesis, Rutgers University, 1995). 
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an infinite number of points, as long as there are a finite number of 
atoms (and given the assumption of a—Lebesgue—auniform proba- 
bility). The classical world is spatial, so all position and momentum 
coordinates are of this kind. We can see the relevance of this very 
simply: consider for illustration a system of two particles in one-space 
dimension. The measure of the set of cells of double occupancy is 
the measure of the “diagonal,” x,=x,  p=f, in the four-dimen- 
sional phase space. In the limit that the cells become continuous 
points, this line has measure zero in phase space, and similarly for 
higher dimensions. The point then is not that the spaces necessarily 
are so constrained, but that they might as well be for the statistical 
properties; which is exactly what we need to see that empirical re- 
sults will not distinguish the spaces for us. Of course, other sets of 
points have measure zero, too, but the point here is that this set is 
the only one in which Z- and I -space counting disagrees by anything 
other an irrelevant (n!) counting constant.* 

Those familiar with statistical physics may find this conclusion un- 
settling. When one deals with thermostatistics one works in phase 
space, and seems to make no assumption that occupation numbers 
are either one or zero. Proceeding in this way, it seems obvious that 
Z-space is just wrong (as it is for the coins). But now we have seen 
that, as a mathematical consequence of a true observation, our opin- 
ion here is incorrect and Z-space is equally appropriate. There is a 
tension produced by the gap between proving that the approaches 
agree and explicitly seeing how. Elsewhere,‘ I derive the standard 
formulae of classical statistical mechanics from Z-space, showing that 
there is indeed no disagreement. Here a simple result will make the 
same point. 

First note that, in general, thermostatistics is based, not on count- 
ing states characterized over a space of individual cells, but over a 
coarse-grained space in which cells are grouped into blocks. So sup- 
pose that our [l-space contains b blocks each containing d cells, for a 
total of 6: d = c cells. Let <n,...n.> ™ <n> be the distribution over 
the blocks, as the O.s are over the cells. Clearly, even if O,<1, it does 
not follow that nl: our constraint on cell occupation numbers is 
compatible with the absence of constraints in the familiar coarse- 
grained approach to thermostatistics. 


B In quantum mechanics, spectra get quantized, so that they are no longer con- 
tinuous, and the assumptions of the proof fail: states with O,>1 are no longer im- 
probable. In this case, the possibility of settling this metaphysical issue 
experimentally reappears. For a full investigation of this suggestion, see my “Identty 

tum Mechanics and Common Sense,” The Monist, LXXX, 1 (1997): 118-30. 

M The Philosophy of Prelds and Particles m Classical Mechanics, pp 74££. 


18 THE JOURNAL OF PHILOSOPHY 


Block j, containing nj 
atoms in its d cells. 


Figure 2. A region of Z-space coarse-grained into equal blocks. 





Strictly speaking, the empirical evidence of thermodynamics to 
which we have referred is that N-(n,) (= nl/I[n from equation one) 
gives the correct—‘Maxwell-Boltzmann’—number of states corre- 
sponding to a given coarse-grained distnbution over the blocks. I shall 
now calculate Nz(n), the number of fine-grained distributions corre- 
sponding to a coarse-grained one, and show that it agrees with the em- 
pirical N-(n,), up to an irrelevant constant that vanishes in the 
observable frequencies. That is, when we take the standard blocked 
approach to statistical physics, the same result is obtained whether we 
suppose the underlying fine-grained space to be phase or distribution 
space. (Note that Nn) = 1 because many distribution over cells will 
give the same coarse-grained distribution.) 

In this derivation, we again assume O,S1, which we take to follow 
from a sufficient diffuseness that d> n= n, so that each block has 
many more cells than the total number of atoms in the gas. Again, 
this is justified on physical grounds by an appeal to the vast number 
of positions and momenta available to a particle in any finite region. 
Then: 


Result II: Given 0O,<1 and d*n, F,(n) = Fn) 


Proof: 
(i) By 0,< 1, each fine-grained cell will contain at most one atom, so” 


18 This proof can be given without an explicit assumption that O,<1, using only 
diffuseness, d*n. Then, to the initial expression for N,(n) add terms, for example, 
for picking #,-1 cells out of d (that is, for double occupancy), for 2-2 cells out of d, 
and so on. But the binomial coefficients for such terms are at least of order d /{(m- 
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N,(n,) = ways of picking n, X ways of picking n, ... X ways of picking n, 
cells out of d cells out of d cells out of d 
b 
= | [a/{nJ - (d-n)1}, using bmomial coefficients (see footnote 7). 
1™=] 


But for p> q pl/(p-q)! ~ pf, so using d>n (diffuseness) we find, by equa 
ton one: 


b b b 
N(n) ~ [[(d"/nl) =d+ 8/[ [nt = a] [nl = (d*/nl)-NXn) 


m Í pm] =] 


(ii) We can write the total number of (fine grained) distributions as >! 
N,(m) to show that it is the sum of N, (n) over all coarse-grained dis- 
tributions. Similarly, >» N-(n,) is the total number of fine-grained 
phases. Hence, using eden three and (1): 


F,(n)= Nz(n)/ > Nn) 
<> 


~ {d*/nl-N(n)}/{d*/nl- >) NAn)}= NAn)/ >) NA) 
= Fn) ue = 


The point of the last step is that constant differences in counting 
wash out in frequencies, which are all that can be observed, and an 
(albeit large) constant is all that separates N; from Np Here it is 
clear that the 1/IIn! factor characteristic of I-space counting and 
Maxwell-Boltzmann statistics also appears in Z-space statistics, not be- 
cause we count over permutations of -states among atoms (as in the 
derivation of equation one), but because it is introduced by the br 
nomial coefficients for arrangements within blocks. Thus there is no 
mystery at all why, in this case, the two spaces are in agreement. Note 
finally, that although the expression is approximate, in the case of a 
classical gas, we can take the system to be arbitrarily diffuse, and so 
there will be arbitrarily high experimental agreement. 

The lesson should be crystal clear. The choice of space 1s not em- 
pirical, because once O,S1 is imposed, the remaining I -states can be 
arranged into nl sets which map 1-1] into Z-states: nl is just an irrele- 
vant normalizaton constant which washes out in the physically rele- 
vant frequencies. The philosophical pay-off is that statistics do not 
select one space over the other, and hence will not yield conclusions 
I)i(amt1)I} = {m/(dm+1)}: d/{mi(dm)!}, which by diffuseness ws highly sup- 


pressed by a factor ~ n/d Such terms can be discarded and the expression for N; 
is an approximaton, not an identty. 
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about modal metaphysics. Rather surprisingly, the Maxwell-Boltz- 
mann distribution does not entail a haecceitistic universe. The statis- 


tical argument fails. What about the entropic argument? 
VL THE ENTROPIC ARGUMENT 


The entropy in statistical mechanics is defined by the logarithm of 
the number of available (coarse-grained) states: 


Sm k-inN(n,) eqn 5 


where kis Boltzmann’s constant. Now, statistical physics has empir} 
cal import because it is intended to explain thermodynamics. In this 
reduction, the statistical entropy, S, is identified (at least in equilib- 
rium) with the thermodynamical entropy o. The main evidence in 
favor of $, (obtained when, in equation five, N= N,(n,)) is that o is 
supposed to be extensive or additive. That is, if two equilibrium sys- 
tems are in mutual equilibrium, then the entropy of their union is 
the sum of the entropy of the two parts: Cing = 0, + Cy Via the iden- 
tification we expect for the statistically defined equilibrium entropies 
that $,,,= 5S, + $. One can straightforwardly check that this equality 
holds for $z, but not for Sp, obtained by substituting N n) for Nin 
equation five. The nonadditivity of Sp thus is supposed to rule out I- 
space in favor of the extensive 5, Further, observation of entropy 
dependent properties shows that they are in agreement with Sp 
Hence we are supposed to accept Z-space and antihaecceitism. I op- 
pose this conclusion in two ways: on the one hand, there is nothing 
compelling about the inference from S, to Z-space; on the other, the 
data of thermodynamics do not speak unequivocally for $z. 

Let us assume that S, is the correct expression of the statistical en- 
tropy; in the sequel, I shall show this assumption to be unwarranted. 
Entropy is a function of the distribution, but any I-state uniquely de- 
fines a distribution so the entropy can equally be considered a func- 
tion with either [-space or Z-space states as its domain. Assuming 
that $; is the correct expression of entropy, then the two functions 
are easy to find from result I’; in Z-space S,mk - InN,, and in T- 
space S,;mk- in(cN,/ni). In this seemingly trivial sense either space 
supports any definition of entropy. The question is: Is there some 
significant sense in which I-space does not support S,? 

The only apparent response is to give a physical meaning to the 
entropy that requires the assumption of Z-space. For instance, the 
statistical entropy can be implicated in an explanation of the “in- 

@ Note carefully that I am amending the definition of S. To make the same cor- 
rection Gibbs inserted an nl factor in Np and concluded that he had an argument 


for Z-space. My point is that one is free instead to redefine the relationship be- 
tween Sand N, so that no such conclusion 1s warranted. 
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evitability” of equilibrium. By equation five, S, increases with N,(n,). 
The entropy grows as the system moves into distributions corre- 
sponding to more numerous sets of states. Since (given indifference) 
these states are more likely, it follows that entropy will “most likely” 
increase to its maximum.’” Thus S, has a physical meaning as a mes- 
sure of the likelihood of a distribution, but this meaning arises inter- 
changeably from T- or Z- space. It follows from our results that the 
likelihood of any distribution is the same in I- or Z-spaces, and 
hence that they can assign the same entropy to any distribution and 
preserve the physical meaning. 

The point here is that defining $, = k-- In(cN;/ n!) is no mere sleight- 
of-hand, for the physical meaning of entropy does not speak in favor 
of Z-space. There is nothing in this story of statistical mechanics to 
force a commitment to one space or another, or to any stance on 
haecceitism. But not only is there no valid inference from S, to anti- 
haecceitism, there is in fact no reason for selecting Sz over Sp 

First consider an argument by N. G. van Kampen (op. cit.) to the 
effect that additivity is not necessary for thermodynamics. Assume 
that the laws of thermodynamics entail all the relevant observable 
phenomena. The question then is: Is additivity entailed by the laws? 
If not, then it is not a consequence of conventional empirical ther- 
modynamics. J restrict our attention further by observing that the 
thermodynamical entropy is introduced through the Second Law, do 
= dq/T (where dq is the increase in heat and Tis the temperature). 
The important point about the law for the argument is that it only 
defines entropy differences as work is done, not an absolute value. 

First, does the Second Law entail the additivity of the entropies of 
distinct disjoint systems? Since it only describes a single system at dif- 
ferent points mn an evolution, the Second Law says nothing at all about 
extensivity in this case. What about additivity in the situation where 
two samples are physically joined? Surely, the evolution from two phys- 
ically disjoint samples into one can be considered as the development 
of one system. But this does not circumvent the problem, for, accord- 
ing to van Kampen (op. at, p. 305), deriving additivity in this way re- 
lies on the assumption that the entropy of identically prepared distinct 
systems is equal. Again, although this is a reasonable and helpful defi 
nition, it cannot follow from a law concerned solely with differences 
along thermodynamical trajectories. The answer seems to be that addr 
tivity is not an observable consequence of thermodynamics. 


17 The sense of ‘moet likely’ is subtle: see, for example, Lawrence Sklar, Philoso- 
phy of Physics (Boulder: Westview, 1992), pp. 113-17. 
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I conclude that there are no observable entropic properties senst 
tive to the difference between I- and Z-space, and hence no observa 
tion will give us any decisive insight into haecceitism. 

van Kampen (op. at, p. 307) explains why we should not be sur- 
prised about this negative conclusion. The idea of statistical mechan- 
ics is that the data be completely explained in terms of the statistics 
of microstates—after all, if the world is made up of atoms, it is what 
they do that counts. If this principle is correct, then predictions de- 
rived using the entropy cannot go beyond the statistical conse- 
quences of the theory. But, as we have clearly seen, all the statistical 
consequences are compatible with both haecceitism and antihaec- 


ceitism; the observable entropic consequences must be too. 
VIL CLASSICAL CONCLUSIONS 


So far I have established the negative result that the data of thermo- 
dynamics do not decide the metaphysical issue of haecceitism; now I 
should consider the possibility that we are faced here with an issue 
that is “metaphysical” in the traditional sense of transcending mere 
scientific considerations. The point is that the difference between I- 
and Z-spaces is genuinely physically redundant. Certainly, they are 
empirically underdetermined, but it is not simply as if one of the the- 
ories had extra unobservable processes taking place, rather phase- 
space simply asserts a difference where distribution-space sees none. 
That is, I -space asserts exactly everything that Z-space does, plus it 
claims that every kind of haecceitistic difference is possible, and 
nothing more. Given this, one is faced with the question of why such 
surplus representational power should be postulated. At the very 
least it provides a significant sense in which the choice between I- 
and Z-spaces is not between physical theories, but between meta- 
physical outlooks. And arguably, since the extra claim of haecceitism 
does no work, Okhamite considerations should drive us to reject it. 

I am tempted to go further here and claim that this is no real dif- 
ference at all: that this so-called metaphysical difference is nothing 
but empty words. It simply asserts a difference without giving an ac- 
count of what it is supposed to be. To say that “this atom” now has 
this property, and now that, and for this to be saying something sub- 
stantial presupposes that something meaningful is being done in the 
denotation in the different situations, and the burden of proof falls 
to the haecceitist to show that this 1s so. 

Finally, let me put the example of classical statistical mechanics to 
work to discover the logical relations between the various aspects of 
individuality. In the first place, the assertion of genidentity and dt 
versity is compatible with the denial of haecceitism. An “excluding” 
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property like locaton in -space clearly behaves in some ways akin to 
a haecceity, grounding a kind of qualitative genidentity and distinct- 
ness (though not haecceitism.) And there is no prima facie reason 
why one should not suppose that these qualitative features of individ- 
uals co-occur with metaphysical ones. Atoms distinguished by loca- 
tions can be supposed metaphysically distinct individuals, and 
instances of particles joined by continuous space-time trajectories in- 
stances of a strictly identical individual. (Of course, these features 
are also compatible with the rejection of any extra metaphysical 
facts, so we are not forced into any metaphysics here either.) This 
reasoning holds in both Z- and I-space, and so antihaecceitism does 
not rule out genidentity or diversity. Neither diversity nor geniden- 
tity are sufficient for haecceitism."® 

But metaphysical genidentity and diversity do not necessarily hold 
in virtue of qualitative features; as an alternative, distinct trajectories 
can be thought of merely as the “marks” of these metaphysical fea- 
tures. But in metaphysics, surely something can occur without its 
qualitative marks? In this case, in the face of the denial of haec- 
ceitism, these components of individuality do not depend on the ex- 
istence of suitable properties. But this is to say that individuals can 
be distinct and genidentical regardless of what properties they have 
in an antihaecceitistic universe. 

What about the reverse implications? Haecceitism must imply dt 
versity: How can we even talk about what is represented de re about 
Peter unless he is distinct from Paul? Genidentity is another matter, 
for surely worlds in a haecceitistic universe could be composed of in- 
dividuals constantly created and destroyed every instant. Similarly, 
genidentity implies diversity, for how can an atom be reidentified 
unless it can be picked out of an ensemble? 

An important motivation for carrying out this lengthy investiga- 
tion is to help us pinpoint the ways in which quantum mechanics dif- 
fers from classical mechanics. One could easily be led to think that 
an important difference follows from the different ways in which 
“identical” particles are handled in the two approaches. Different 
statistics point to different metaphysics of individuality in each case: 
haecceitism in classical mechanics and anthaecceitism in quantum 
mechanics. My analysis has been directed at showing that there are 
no very heavy metaphysical implications of classical physics, and that 


18 Actually the issue us not quite this straightforward if paths of genidenuty can 
become detached from a continuous trajectory, so that continuents ‘hop around’ 
the continuous paths mapped out by matter. Then any instantaneous state is haec- 
celtistic, because ‘this atom’ could have started on any of the trajectories. 
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therefore anticipated innovations in the notion of an individual in 
quantum mechanics will not be innovative at all. This is not to say 
there are no differences at all, but it is to say that the novelty of 
quantum mechanics is artificially heightened if we add to classical 
physics the unwarranted baggage of “a metaphysical position which 
has been moribund for centuries.”'® 

NICK HUGGETT 
University of Dllinois/Chicago 


© Bas van Fraamen, Quantum Mechanics: An Empiricist View (New York: Oxford, 
1991), p. 434. For an investigation of the metaphysics of identity entailed by quam 
tum mechanics, see my “Identty: Quantum Mechanica and Common Sense.” 
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SEEING INTERSECTING ECLIPSES* 


am viewing a double eclipse of the sun. Traveling east is the heat 

enly body Far. Traveling west and nearer to me is the smaller 

body Near. Near is close enough to compensate exactly for its 
smaller size with respect to shadow formation. Near and Far look the 
same size from my vantage point. When Near falls exactly under the 
shadow of Far, it is as if one of these heavenly bodies had disap- 
peared. Do I see Near or Far? 


Os 


Figure | 


* I thank farsighted members of colloquium audiences at Dartmouth College, 
the Univernty of Connecticut, and the University of Delaware for their perceptive 
comments. I also received insightful counsel from Jonathan Adler, John Carroll, 
Fred Dretske, Julia Driver, Doug Ehring, Brian McLaughlin, David Robb, Walter 
Sinnott-Armstrong, Gerald Vision, and Denis Walsh. 
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Common sense answers that I see Near rather than Far: Near is an 
opaque body that completely blocks my view of Far. Since I see some- 


thing, I see Near. 


I THE CAUSAL THEORY CORRECTLY PRECLUDES NEAR 

According to the causal theory of perception, S sees object O just 
when there is an appropriate causal connection between S and O. 
‘Appropriate’ has proven difficult to spell out. But the theory entails 
a necessary condition that clearly precludes the possibility that I see 
Near. S sees O only if O is a cause of what I see. Near is totally within 
the shadow cast by Far. An object that is completely enveloped in a 
perfectly dark shadow cannot be seen. A cat silhouetted against the 
moon can be seen by virtue of the light it blocks. But since Far is pre- 
venting any light from striking Near, Near is not casting a shadow. 
Near is causally idle and therefore invisible. If light were being re- 
flected by Near into my eyes, Near would preempt Far; and I would 
see Near. But since I am actually seeing by virtue of partial light 
blockage, it is Far that preempts Near. 

H. P. Grice’ motivated the causal theory of perception with a 
thought experiment. Suppose I believe I am facing an unoccluded 
pillar. Unbeknownst to me, there is a mirror that is reflecting the im- 
age of the pillar to my rear. On the opposite side of the mirror there 
is a pillar, indeed one that would look just like the one in the mirror. 
If the mirror were removed, there would be no phenomenological 
difference. Under these conditions, common sense correctly says I 
see the mirror and not the occluded pillar. The reason is that the oc- 
cluded pillar plays no causal role in what I see. 

Or suppose that there are two identical coins in my line of sight, 
one behind the other. If the front coin is rolled away, the scene will 
look the same. Nevertheless, I do not see the rear coin because it 
makes no actual difference to what I see. The reason why I segt 
the objects facing me is that only they are in a position to transmit 
light into my eyes. More generally, the perceiver’s field of vision is 
widely agreed to comprise the froni surfaces of those opaque bodies 
lying within a conical region radiating outward from the perceiver’s 
open eyes. The causal theory of perception provides a natural expla- 
nation of why my field of vision is structured as it is. 

Alvin Goldman? uses satisfaction of this environmental relation be- 
tween perceiver and perceived as one of three defeasible factors in 
determining whether an object is seen. His second factor is the exis- 


**The Causal Theory of Perception,” Proceedings of the Anstotshan Society, Suppl. 
VoL xxxy (1961): 121-68, here p 141. 
* “Perceptual Objects,” Synthese, XXXV (1977): 257-84. 
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tence of a physical mechanism. By this Goldman means a causal 
pathway by which the object transfers energy to the perceiver. His 
| last factor is counterfactual dependency: Would the scene have 
looked different had the object not been there? Each of these factors 
has some say but none has a veto. Consequently, there will be hard 
cases. Goldman considers Jaegwon Kim’s example of an astronomer 
looking at a black hole through a telescope. The astronomer sees 
the black hole by virtue of its contrast with the background star light. 
According to Goldman, the black hole is in the perceiver’s field of vi 
sion and satisfies the counterfactual dependency condition but vio- 
lates the physical mechanism condition. 

To save the causal theory, Goldman claims that ‘cause’ is ambigu- 
ous between a counterfactual-dependency sense and a physical- 
mechanism sense. Since parallelism and occasionalism deliver 
counterfactual dependency without causation, this ambiguity de- 
fense fails. For good or ill, the causal theory unequivocally con- 
demns idle objects to invisibility. If God made an astronomer have 
hallucinations of a black hole that reliably track the black hole, the 
astronomer would not see the black hole. 

Since black holes violate just one of Goldman’s conditions, he 
(correctly) acquiesces to the intuition that the astronomer sees the 
black hole. It is striking that Far violates all three of Goldman’s con- 
ditions. The environmental relation is unsatisfied because Near is in 
front of Far. The counterfactual dependency condition is violated 
because I would notice no change in the scene if Far were not pres- 
ent. And the physical mechanism condition is violated because Gold- 
man requires that the physical mechanism involve an energy transfer 
from the object to its perceiver. 

Although Near satisfies the environmental condition, Near violates 
Goldman’s other two conditions. Near transfers no energy to me. And 
if Near were absent, the scene would be indistinguishable. Hence, 
Goldman is committed to saying that I see neither Near nor Far. 

I have future comments on each of Goldman’s factors of percep- 
tion. But here is an immediate remark about the environmental rela- 
tion. The relativity of ‘occlude’ forces a corresponding relativization 
of our field of vision. Consider two dark nebulae that exactly overlap 
in the manner of Near and Far. A dark nebula is a dense cloud of in 
terstellar dust. Dark nebulae do not transmit light in the optical re- 
gion of the spectrum. Yet they are conspicuous in any photograph of 
the Milky Way because of their contrast with ambient starlight. The 
twist is that “dark” nebulae glow brightly in the infrared because in- 
terstellar dust efficiently absorbs optical and ultraviolet radiation. An 
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astronomer peering through an infrared telescope can see the near 
nebula. But an astronomer using an optical telescope cannot. Rela- 
tive to infrared observation, the near nebula occludes the far nebula. 
But the reverse holds for optical observation. The infrared as- 
tronomer has a classical field of vision in which he sees the fronts of 
objects. The optical astronomer has a backlit field of vision in which 
one of the objects is visible by its backside. 

The correct solution to the eclipse riddle is that I see Far. This an- 
swer follows as a bold consequence of the causal theory of percep- 
tion, given natural background assumptions. ‘I see Far’ will seem 
bizarre as long as we persist in modeling all vision on seeing objects 
that transmit light (by reflection, refraction, or emission). We must 
be careful not to prejudice the explanation of our field of vision by 
overlooking alternate lighting conditions. In particular, when objects 
are backlit and are seen by virtue of their silhouettes, the principles 
of occlusion are reversed. In backlit conditions, I can hide a small 
suitcase by placing a large suitcase behind it. The reversal runs 
deeper: we see the backs of back-lit objects. Although this thesis 
seems like parapsychology, my premises are drawn from the reigning 


orthodoxy. I am a conservative rather than a radical. 
I. SHADOW ARCHITECTURE 


Any opaque object can be seen by virtue of its silhouette. The shade 
formed by a back-t object is a volume that commences from the far 
side of the object. Call this initiating surface the silhouette.’ The shade 
extends from the silhouette in accordance with principles of projec- 
tive geometry. The shadow can be truncated only by light Intervening 
opaque objects do not stop the shadow. After all, the shadow is a priva- 
tion of light rather than a positive force. Hence, it does not need to 
penetrate objects to reach the opposite side. Nevertheless, the shadow 
is sensitive to whether the object contains light Consider a silhouetted 
safe that contains a flare. If the flare is burning, then the near wall of 
the box blocks light and so casts the shadow at the base of the safe. 

The dark area on the initial surface of an intermediate object is 
called a “cast shadow” by architects. They call the shade between the 
beginning and end of the shadow “the invisible shadow” (though 
they admit it can be seen when the air is dusty). 


> The representational sense of ‘silhouette’ is etymologically prior to its mereo- 
logical sense. The notorious French controller general of finance, Edenne de Sib 
houette (1709-1767), had a hobby of cutting shadow portraits out of black paper. 
‘Silhouette’ came to be applied to depictions of this sort and then later became ap- 
plied to the causal basis of those depictions. Compare ‘silhouette’ with ‘pin-up’, 
which at first denoted only poster depictions of sexy women and then came to de- 
note the subjects of those depictions. 
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Figure 2 


Shadows are not mere absences of light. Shadows require light 
sources and transparent regions where light would have entered 
were it not for objects that block the source. A brick resting flat on 
the ground at high noon casts no shadow. If I lift the brick, it casts a 
shadow. But I have not discovered a pre-existing shadow. 

Philosophers exacerbate the mystery of shadows by treating the 
shadow as if it consisted solely of the cast shadow. For instance, 
Avrum Stroll* contrasts the three dimensionality of a physical object 
with the two dimensionality of shadows. Whereas two objects cannot 
be the same place at the same time, a two-dimensional shadow can 
be “located in the same portion of space that, say, a stretch of road is 
occupying. So shadows have an existence in space without displacing 
space” (ihid., p. 75). Stroll is led to wonder whether shadows violate 
John Locke’s® principle that two things of the same kind cannot be 
in the same place at the same time. If two people cast shadows that 
exactly overlap, why must there be one shadow rather than two 
(ibid., p. 72)? 


: ee ee Oxford, 1994). 
* Locke states the principle in Essay concerning Human Understanding, Essay I, 
chapter 27, section 1. 
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Puzzlement is diminished by the architect’s conception of the cast 
shadow as a surface of a three-dimensional volume. The two dimen- 
sionality of the cast shadow is due to its status as a surface rather 
than its status as a shadowy thing. Hence the cast shadow has no spe- 
cial property of being in space without displacing space. If an inter- 
loper walks exactly into my invisible shadow, then he does not cast a 
shadow. To cast a shadow, he must block light. 

With the help of separate light sources, two coins can causally 
overdetermine the same cast shadow. If you glue penny A to the cen- 
ter of flashlight A and penny B to the center of flashlight B, then you 
can position the pair of flashlights so that penny A and penny B cast 
the same shadow against a wall. The symmetry of the situation pre- 
vents either flashlight from preempting the other. Since each penny 
generates a distinct volume of shade, their shared cast shadow is like 
the shared section of two intersecting roads. It is also possible for the 
two pennies to share nearly all of their invisible shadows. Just point 
the two flashlights directly at each other. 

The parasitic nature of shadows makes it natural to infer that “a 
shadow is sustained in existence by the continued existence of its 
originating source.” ë But shadows can survive the destruction of 
their originators; consider a tree that is constantly illuminated as it 
petrifies into stone. The shadow begun by the tree is continued by 
the stone. 

It is also possible for a shadow to persist without any sustaining ob- 
ject. Light travels at 299,792,458 meters per second in a vacuum. The 
moon is about 384,400,000 meters away from the earth. Hence, if the 
moon were instantly obliterated during a solar eclipse, its shadow 
would linger for over a second on the surface of the earth. If the moon 
were further away, its shadow could last several minutes. We can extrap- 
olate to posthumous shadows that postdate their objects by millions of 
years. We can also speculate about an infinite past in which a shadow is 
sustained by a beginningless sequence of objects. As one object is de- 
stroyed, it is seamlessly replaced by an object of the same shape and 
size. Arguably, this shadow antedates any object in the sequence and so 
threatens the principle that every shadow is caused by an object.’ 


* Samuel Todes, “Shadows in Knowledge,” m D. Inde and R M. Zaner, eds., Se 
lected Studies in Phenomenology and Existential Philosophy (The Hague: Nijhoff, 1975), 
pp. 95-118, here p. 96. 

” Yet more speculative is Samuel Levey’s scenario involving an infinite sequence 
of walls where wall n has a thickness 1/2" meters. If the sun rises from the thin 
side, which wall casts the shadow at the base of the ee 
scenario is either physically rmpossible or underspecified. What do the photons hit 
if there is no first wall to strike? 
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Do we see silhouetted objects that no longer exist? The answer is 
the same as the solution to the old riddle prompted by starlight that 
survives its star: Can we see stars that no longer exist? Yes, but we see 
them as they were, not as they are. 

Since the silhouette of an object is part of the object, there can be 
cast shadows without silhouettes. Thus the leading edge of a shadow 
need not be a silhouette. (Since an uninterrupted shadow ends at a 
region of space instead of at the surface, it follows that some shadows 
have neither a silhouette nor a cast shadow.) When the object is de- 
stroyed, the leading edge of the dark three-dimensional volume 
races toward its opposite edge at the speed of light. Small volumes of 
shade appear to be destroyed at exactly the time that the object is de- 
stroyed. But, strictly speaking, shadows last longer than their de- 
stroyed objects. 

The speed of light compels a surprising qualification to my answer 
to the eclipse riddle.’ Initially, Near blocks light because there will 
be photons trapped between Far and Near. Once Near absorbs these 
intermediate photons, it will no longer block any light and conse- 
quently become invisible. Only at that stage do we see Far. 

Although cast shadows are displaced and usually distorted, silhou- 
ettes provide a visual match that is comparable to that achieved with 
transmitted light. Seeing a silhouette of an object counts as seeing 
the object itself. ‘This principle is prehistoric and culturally universal. 
Silhouette portraits are the earliest and most universal genre of rep- 
resentational art. Some are found in prehistoric caves. There are an- 
cient specimens from Egypt, China, and Greece. Intricate 
eighteenth-century silhouette portraits are housed in London’s Na- 
tional Portrait Gallery. 

This simplest form of portraiture was a stepping stone to early 
photography. Instead of tracing a shadow by hand, John Herschel 
had nature fix the image with potassium ferricynanide in 1839. 
Botanists would place a plant directly on the photosensitive paper 
and shine a strong light on it. The blue and white image, a precursor 
of the blueprint, was sensitive to the varying opacity of the plant— 
unlike simple silhouettes. Cyanotype photographers were aware that 
the images were of the back sides of the plants; for they warn novices 
to turn the intricate features face down. Further piecemeal improve- 
ments in photochemistry and optics allowed the object’s image to be 
taken without direct contact with the photosensitive paper. Further 
improvements decreased exposure time, increased the durability of 


* I owe this point to Laurie Paul. She notes that the qualtfication 1s not needed 
if Far 1s stahonary 
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the photograph, and formed a continuous developmental sequence 
leading up to modern photography. 

Night photography harkens back to these roots. Nazi naval archives 
contain ominous silhouette photographs of targeted ships. Although 
New York City was out of range of Nazi bombers, cargo ships using 
New York harbor were in range of Nazi submarines. The cargo ships 
turned off their sea lights but were backlit by the bright lights of 
Manhattan. (This explains why New York City had to be blacked out 
during World War I.) The Nazi archives also include silhouette pho- 
tographs of the effects of cargo ships—their wakes, smoke, debris, 
and the like. These photographs of ship effects can be distinguished 
from the photographs of the ships themselves. Consider a submarine 
captain who first spots traces of a ship against the moonlight. Since 
the captain sess that a ship is in the vicinity, he follows its trail until, f 
nally, he sess the ship itself profiled against the moonlight. 

Purely contrastive seeing is not an ability peculiar to homo sapiens. 
Aquatic predators frequently hunt from below. Light enters water from 
above, so prey are back-lit. A shark that approaches a school of anchovies 
from the side will be dazzled by the light reflected by the anchovies’s 
scales as the school sharply pivots in unison. Some sharks counter by dri- 
ving the school toward the surface and then lunging in from below. The 
sharks see the anchovy silhouettes. ‘They also see the anchovies. 

One rationale for counting the sighting of a silhouette as a sight- 
ing of the object might be that the visual match is sufficiently good— 
or as good as obtained during nocturnal conditions when we see 
only in black and white. This fidelity explains why silhouettes are so 
widely used as iconographic symbols. 

As in the case of seeing transmitted objects, the good visual match 
holds only as a general trend. The silhouette of a pile of junk does 
not look much like a pile of junk. But if I see a silhouette of a pile of 
junk, I see a pile of junk. Interviewers preserve the anonymity of in- 
formants by televising them in silhouette. Viewers see the informant 
but cannot identify him on the basis of what they see. 

Under some conditions, a cast shadow provides just as strong a vi 
sual match as the silhouette. Early silhouette portraitists traced the 
sitter’s shadow as it was cast against white paper. Casting a shadow on 
a cloud with a powerful search light makes it seem as if one is seeing 
an object back-lit from the sky above. Hence the “bat signal” from 
the Batman comic strip is called a silhouette. 

An object’s genuine silhouette has a more robust resemblance to the 
object than the object's cast shadow. The cast shadow is only compara- 
ble to the corresponding silhouette under a narrow set of conditions— 
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and only if we ignore the spatial displacement. Causal theorists reject 
the hypothesis that seeing is a matter of having experiences that resem- 
ble the scene. Resemblance is neither a necessary nor a sufficient con- 
dition for seeing. Causal theorists agree that there is generally a good 
visual match, but seek to explain that fidelity in terms of causal connec- 
tions between visual experience and the scene. Why do silhouettes gen- 
erally provide a better match than cast shadows? Why do I see an object 
by seeing its silhouette but not by seeing its cast shadow? 

Note that I am using ‘see’ in Fred Dretske’s® sense of nonepis- 
temic seeing. Whereas seeing that a is F entails belief that a is E, 
nonepistemic seeing lacks belief content (did. p. 88). When cave 
men witnessed a solar eclipse, they saw the moon even if they had no 
beliefs about what they were seeing. 

Or consider the original situation in which I am in doubt about 
whether I am seeing Near or Far. Given my lack of belief, I cannot 
be said to see that the silhouetted object is Near or see that the sil- 
houetted object is Far; for I lack the belief that the object is Near 


and lack the belief that the object is Far. 
IIL CAUSAL PROCESSES AND PSEUDO-PROCESSES 


Wesley Salmon’? draws a distinction that helps the causal theorist ex- 
plain why seeing the silhouette of an object is seeing the object but 
seeing its cast shadow is merely seeing an effect of the object. When 
a blimp casts a shadow that races along the ground, the moving 
shadow has persistence conditions that are entirely parasitic on the 
blimp and the sun. I can predict where the shadow will move by 
keeping my eyes on the ground and tracking the shadow’s speed and 
direction. But the stages of the shadow’s movements are not related 
as cause and effect. Rather, they relate to the stages of the blimp’s 
movements. Elliott Sober’! diagrams the causal structure by using > to 
represent causal connections and broken arrows as the absence of a 
causal connection (ibid., p. 303): 


causal process C0 > QR 3) G >» © > © 
2 x x N 
pseudo-process PI -/-> P2 -/-> P3 -/-> P4 


* Seang and Knowing (Chicago: University Press, 1969), p. 88. Do not picture 
sna temic seeing as restricted to agnostic viewing. Nonepistemuc seeing is com- 
c with being heavily opinionated about what one 1s seeing. Dretske’s point is 
that this sense of ce does not entau any beliefs. 
Explanahon and the Causal Structure of the World (Princeton: University 
Bee 


u “A Plea for Pseudo-Processes,” Pacific Philosophical Quarterly, LXVI (1985): 
303-09, here p 303. 
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If I watch the moving shadow cast by the blimp, then I see a pseudo- 
process governed by the blimp’s movement. But if I watch the silhouw- 
ette of the blimp, I see a causal process. The same point holds if the 
blimp is stationary. 

If the blimp collides with a skyscraper and is destroyed, the 
shadow is terminated. If only the shadow meets the skyscraper, the 
shadow is momentarily deformed but then, after passing the sky 
scraper, continues on exactly as before, carrying no trace of its en- 
counter with skyscraper. I can enlarge the cast shadow by inserting a 
flag into the perimeter of the invisible shadow as the cast shadow 
passes in front of me. But this alteration does not persist after the 
shadow passes the flag. Putting a flag on the blimp does result in a 
lasting change. I can mark the blimp but cannot mark the shadow. 


A given process, whether it be causal or pseudo, has a certain degree of 
uniformity—we may say, somewhat loosely, that it exhibits a certain 
structure. The difference between a causal process and a pseudo- 
process, I am suggesting, is that the causal process transmits its own 
structure, while the pseudo-process does not. The distinction between 
processes that do and those that do not transmit their own structure is 
revealed by the mark criterion. If a proceas—a causal proceas—is trans- 
mitting its own structure, then it will be capable of transmitting certain 
modifications in that structure.” 


The silhouette of the blimp is a causal process because it satisfies the 
mark criterion. A change of the silhouette is inherited by future 
stages of the silhouette. 

Since an intrinsic change in the silhouette constitutes an intrinsic 
change in the object, the silhouette is part of the object. A silhow- 
ette, in this mereological sense, is not a depiction because it exists im 
dependently of any intention to represent the object. 

The moon is like the blimp during a solar eclipse. The silhouette 
of the moon is a causal process. If the silhouette is changed by a me- 
teor collision, then the silhouette retains that disfigurement. The 
shadow cast by the moon is a pseudo-process, however. As it races 
along the surface of the earth, local mterventions have no effect on 
future stages. 

The shadow moves as quickly as the earth spins. In principle, a 
shadow can move faster than the speed of light. Suppose I held my 
thumb over a flashlight that could transmit light across vast dis- 
tances. I wave the flashlight, sending the shadow of my thumb cours- 
ing through the heavens. The rate at which the cast shadow moves 


™ Sahnon, p. 144. 
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increases geometrically with distance. At a large enough distance, my 
thumb’s shadow is moving faster than the speed of light. 

This is compatible with the special theory of relativity.’* The the- 
ory says Only that no signal can travel faster than the speed of light. 
The shadow is not a signal because it cannot transmit information. 
My thumb’s shadow when cast on surface A does not pass on any in- 
formation from A to any subsequent location B. The shadow’s fea- 
tures at Binstead depend on the flashlight and my thumb. 

The silhouette of my thumb can move only as fast as my thumb; 
for the silhouette of my thumb is where my thumb is. The persis- 
tence conditions of a silhouette make it a causal process. A process is 
a persistence of something—a wave, a table, and so on.™ Both the 
object and its silhouette are genuine causal processes. 

Plato’s allegory of the cave was inspired by the shadow plays of 
puppeteers. In the allegory, men are shackled together in a way that 
keeps them facing a cave wall. Behind and above them is a fire and a 
walled walkway. The barrier conceals servants who stroll by with fig- 
urines above their heads. These figurines cast shadows on the cave 
wall. This shadow play is the only reality for the prisoners who have 
never seen things under normal conditions. The prisoners consider 
the shadows as objects in their own right. They do not view them as 
signs of other objects. They do not know that the shadows are cast by 
figurines and so do not use the shadows to make inferences about 
those figurines. When the prisoners view the shadows in this fashion, 
they do not see the figurines. My explanation is that one can see only 
an object via a causal process emanating from that object. The pris- 
oners do not see any objects even though they can predict the pat- 
terns of shadow play. 

If a prisoner is turned toward the silhouettes of the figurines, then 
he sees the figurines—whether he recognizes them or not. Suppose 
the prisoner comes to have normal knowledge of the world but re- 
sumes his original station in front of the shadows. This prisoner 
knows that the figurines are casting the shadows. He can see that the 
present shadow is being cast by the horse figurine. Yet he no more 
sees the figurine than his ignorant companions. He sees exactly what 
his ignorant cavemates see. Since nonepistemic seeing has no belief 
content, what one sees cannot be increased by learning. 

Astronomy teachers warn that staring at an eclipse causes irre- 
versible eye damage. Galileo’s telescope magnified this destructive 


» Milton A. Rothman, “Things That Go Faster Than Light,” Scentific Amencen, 
ccu, 1 July 1960): 142-52. 
4 Salmon, p. 140. 
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power and contributed to his blindness. Hence teachers encourage 
students to view eclipses by means of pinhole cameras. The students 
can safely watch the shadow cast on the ground by the moon as it 
crosses in front of the sun. Teachers assure the students that they are 
seeing the moon eclipse the sun. But the students are actually like 
the enlightened cave prisoner who knows the shadow is cast by an 
object that is, sadly, out of view. Only a student who steals a glance at 


the silhouette in the sky sees the moon eclipsing the sun. 
IV. SILHOUETTES AS ABSORPTION SURFACES 


The silhouette is the surface of the object that makes it visible by 
virtue of the light that it blocks. Therefore, to see a silhouette of an 
object is to see a part of it. Seeing a relevant attached part of an ob- 
ject suffices for seeing the object sympbliciter. The reason why we count 
seeing the silhouette of an object as seeing the object is that we are 
seeing a surface of the object. 

Leonardo Da Vinci propounds a whittling objection to the as- 
sumption that the surface of an object is the outermost material part 
of an object.” If the surface is the outermost part of the object, then 
how can it have any thickness? If it were one millimeter thick, then 
only the outermost 1/2 millimeter could be the outermost part of 
the object. That 1/2 millimeter could not be the outermost part be- 
cause only its outer half could qualify as the outermost part of the 
object. That outer 1/4 millimeter can itself be halved. And so on. 
Consequently, surfaces have 0 thickness and thus cannot be a mater- 
ial part of the object. Da Vinci goes on to infer that the surface is a 
common boundary between an object and its environment. 

The whittling argument illicitly manipulates the domain of dis- 
course for the quantifier ‘outermost’. What counts as the parts of an 
object varies with purposes. Sometimes the parts include a coat of 
paint, the enamel, the peel, and so on. Sometimes not. The domain is 
nondissective; parts of the parts need not be in the domain of dis- 
course. If I say that the surface of my couch is plaid, I am not refuted 
by the observation that none of the small parts of the surface is plaid. 
Da Vinci’s whittling argument can proceed only by shifts of context to 
smaller and smaller parts. Thus, the argument that material objects 
lack surfaces is much like Peter Unger’s’® argument that no material 
object is flat. Both of these arguments are refuted by the kind of con- 
textual analysis David Lewis proposes in “Scorekeeping in a Language 
Game.” 


3 See Avrum Stroll, Surfaces (Minn lis Minnesota UP, 1988), pp. 40-46. 
* Ignorance (New York: Oxford, 1975 
'" Journal of Philosophical Logic, vit (1979): 339-59. 
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Like most philosophers and psychologists, I say that we see objects 
only by seeing their surfaces. Stroll? cites Venus as a counterexam- 
ple: we see Venus even though the surface of Venus is completely 
covered with clouds. I say that we see Venus only if we count the 
clouds of Venus as part of Venus. If we count Venusian clouds as not 
part of Venus, then we do not see Venus. The same holds for objects 
that are marked to make them visible. If I paint part of a perfectly 
transparent ball black, then I see the ball only if the paint is part of 
the ball. 

Stroll denies that Jupiter has any surface at all since it is gaseous. 
Here, I concede that it is vague where the surface is. But the vague- 
ness of the location is compatible with Jupiter having a genuine sur- 
face. Microphysicists regard the surfaces of all objects as vague. 

Stroll rightly observes that philosophers and perceptual psycholo- 
gists have done too little to defend their presupposition that all ob- 
jects have surfaces. And he is right again about their lack of support 
for the presupposition that we see by virtue of these surfaces. But I 
think both presuppositions are correct. Therefore, I rely on both 
presuppositions when assimilating purely contrastive seeing to the 
case of seeing objects that transmit light. But I endorse Stroll’s objec- 
tion to the assumption that we see by virtue of a single surface. 
Purely contrastive seeing shows that there are at least two types of 
surfaces that play a role in vision. 

Surfaces have physical properties. In frontit conditions, we see an 
object by virtue of the light transmitted by its front layer. In back-lit 
conditions, we see the object by virtue of light blocked by its back 
layer. A layer blocks by a different mechanism than it reflects. Conse- 
quently, the surface of a backlit object consists of the portion of the 
object that is just enough to block the light. This absorptive surface 
must be distinguished from the reflective surface. We can represent 
the two surfaces of the far side of the moon like so: )). The outer 
parenthesis is the surface relative to light reflection. The inner 
parenthesis is the surface relative to light absorption. An astronaut in 
orbit over the far side of the moon during a solar eclipse does not 
see the silhouette of the moon even though he sees the reflective 
surface of the far side of the moon. Viewers from earth are seeing 
the absorption surface. 

More precisely, the viewers from earth are looking into a concave, 
bowlshaped surface. It is indistinguishable from a dark disk just as 
the silhouette of a cube (viewed corner on) is indistinguishable from 


" Surfaces, pp. 75-76. 
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the silhouette of a flat hexagram. But we are more apt to interpret 
the silhouette of the moon (in particular, the near side of the moon) 
three dimensionally as a convex surface. Hence the temptation to in- 
fer that the moon or sun has turned black. Most objects are convex 
rather than concave. Hence our strong preference for convexity 
could simply be due to the incorporation of an empirical regularity 
into our visual system. 

Familiar seeing by transmitted light is surprisingly interpretative. 
The image of a square could be produced by an infinite variety of 
trapezoids or tilted rectangles. But silhouettes are much more inter- 
pretive because much less information is conveyed. Perceptual psy- 
chologists are impressed by how many of these physically possible 
interpretations never occur to the viewer. Usually, we promptly com- 
mit to a single interpretation. David Marr” concludes that we disam- 
biguate by hard-wired guidelines: 

Somewhere, buried in the perceptual machinery that can interpret sil 

houettes as three-dimensional shapes, there must lie some source of ad- 

ditional information that constrains us to see silhouettes as we do. 

Probably,...these constraints are general rather than particular and do 

not require a priori knowledge of the viewed shape (thid., p. 219). 


Marr articulates these assumptions as if the lines of sight emanate 
from an illumined object. If each point on the silhouette corresponds 
to one point on the viewed surface, we obtain a curve that serves as a 
contour generator. The three-dimensional shape of the object can be 
inferred, given Marr’s further pair of assumptions that nearby points 
on the contour of the image correspond to nearby points on the ob- 
ject and that all points on the contour generator lie in a single plane. 

Thus, Marr underwrites our tendency to view the silhouette of the 
moon as convex with this rational reconstruction. If he is correct, our 
visual system violates the causal theory of perception. The system as- 
sumes, in effect, that we are seeing a part of the object that is 
causally idle, namely, the front surface of a silhouetted object. This 
hard-wired heresy might contribute to the counterintuitiveness of my 
thesis that we see the backs of silhouetted objects. 

Regardless of whether I am laboring against a neurological illu- 
sion, my thesis that we see the (inner) far side of the moon during 
an eclipse is paradoxical. The general reason is that we tend to 
model vision on cases where we see objects by means of the light 
they transmit. This makes it seem like we are seeing through the 
3,600 kilometers of solid rock between us and the far side. 


% Vinon (San Francisco: Freeman, 1982). 
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To lessen the appearance of a miracle, some people suggest that I 
am only seeing the edge of the moon during a solar eclipse. The 
edge can block light, however, only if it is a physical object rather 
than an abstract mathematical limit. To be visible from earth, that 
edge would have to be many meters wide. If I can see the backside of 
this giant ring of matter, then I would still be seeing a surface that is 
behind several kilometers of solid rock! Reducing the quantitative 
scale of a miracle secures no advantage: “Don’t be so incredulous; Je- 
sus only restored sight to one eye of the blind man.” Half a miracle is 
still a miracle. 

Suppose a florescent balloon meanders in front of a solar eclipse. 
First the glowing balloon blocks our view of the bottom edge of the 
moon, then the middle of the moon. But if the balloon blocks our 
view of the middle of the moon, then we must have previously seen 
the middle of the moon. Therefore, we were not just seeing the edge 
of the moon during the solar eclipse. 

Here is a summary of the basic argument for the thesis that I see 
the entire backside of silhouetted objects. When I see the silhouette 
of an object, I see the object. I see an object only if I see part of it. 
That part must cause my perception in an appropriate way. The only 
part that can play this causal role is the object’s absorption layer. 
Therefore, when I see a silhouetted object, I see its back surface. 

V CONFUSION ABOUT OCCLUSION 
Objects that are seen at least partially in virtue of the light they trans- 
mit can be blocked from view by interrupting their transmission of 
light. But a perfectly dark object does not owe its visibility to the light 
it transmits. Hence, these nontransmitters are an exception to the 
principle that one can always conceal an object by interposing an 
opaque body. We do not need to see through the “blocking” object. 

This point solves a puzzle about cast shadows. The first version was 
devised by Robert Fogelin around 1967 or 1968. It became a topic of 
lunchtime conversations with his colleagues at Yale University: 
Charles B. Daniels, Robert Stalnaker, Richmond Thomason, Samuel 
Todes, and Bas van Fraassen.™ The Yale puzzle and its solution were 
presented in detail by Todes and Daniels.” But I shall present the 


* More oral history. My chief informant, Fogelin, was dissatisfied with a cok 
league's analogy between ‘Only I can feel my pain’ and ‘Only I can cast my 
shadow’ Fogelin devised a complicated eni involving infinitely many shadow 
casters. Fogelin is not sure whether the simplification to two shadow casters was his 
doing, the doing of another individual, or the product of the Yale collective con 
sciousness. 

* “Beyond the Doubt of a Shadow. A Phenomenological and Linguistic Analysus 
of Shadows,” in Ihde and Zaner, eds., pp. 86-93. 
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simplified version van Fraassen mentions in Laws and Symmetry. AL 
though ‘shadow’ is often used as a count noun, it should be read be- 
low as a mass noun (as in ‘How much shadow is there in the 


photographr’): 
L If X casts any shadow, then some light is falling directly on X 


I. X cannot cast shadow through an opaque object. 
IL All shadow is shadow of something (thid., p. 217). 


Imagine a barn casting shadow on a sunny day. A bird flies between 
the barn and the shadow cast on the ground. The shadow directly 
beneath the bird cannot be cast by the bird (by virtue of I). Nor can 
it be cast by the barn (by virtue of II). But no third thing can cast the 
shadow. Hence, II is violated. 

Principle I is true but insinuates that the only way that a shadow 
can appear on the far side of an intervening object is by 
penetration.” It discourages us from considering the alternative that 
the shadow appears on the far side by default. Nothing aside from 
the original blockage of light is needed to place shadow there. 

It is true that (unreflected) light can appear on the far side of an 
intervening object only by penetrating it. But opaque bodies do not 
block shadows. Nor do they let shadows pass through. It is also a mis- 
take to picture a transparent pane of glass as letting a shadow pass 
through. The glass only lets light pass through. But this does not 
mean that glass blocks darkness. Talk of blocking or transmitting 
shadows is a symptom of the fallacy of reification. 

A linguistic philosopher might be tempted to trace the reification 
to the fact that ‘shadow’ is a mass noun even though it has no mass. I 
suspect a stronger influence, however, is the representational econ 
omy of treating shade as if it were colored light. This fiction lets us 
effortlessly extend the projective geometry that is used to depict ilu- 
mined objects to the depiction of shadows. Artists sometimes draw 
with white chalk on dark paper. They represent shadows with un- 
drawn regions. This omissive method of representing shadows 
echoes scientific reality. We normally use light paper, however. To 
depict a dark area, we must shade in with a pencil or some dark sub- 
stance. This practice abets the reification of shadows. 

Darkness is an absence of light, not a substantive force that can be 
stopped with shields. Once the light has been stopped, nothing fur- 
ther needs to be done to ensure darkness. This asymmetry of manip- 
ulation is a general difference between properties and their 


™ New York: Oxford, 1989. 
» Todes and Daniels, p. 90. 
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privations. A homeowner who insulates his house cannot literally 
keep the cold out, he can only keep the heat in. Cold is the privation 
of heat and hence is not a substance that can be directly manipu- 
lated. 

Cold spots, shadows, and holes are not objects.* Therefore, they 
do not challenge the principle that perceivers must have a causal 
connection with the seen objects. To see a hole is to see a perforated 
object. It is not the same as seeing that the object is perforated be- 
cause one might mistakenly believe that one of the perforations is a 
spot of dark paint. We nonepistemically see holes. But this just 
amounts to nonepistemically seeing an object that has a certain 
shape. When I see a dark doorway, I see a holed wall. Behind every 


privation is a substance that suffers the privation. 
VI DARK OBJECTS MUST EFFECT CONTRASTS 


W. R. Brain” regards perception of perfectly dark objects as a coun- 
terexample to the causal theory of perception. But the causal theory 
correctly insists that we need light to see—if not light transmitted by 
the object, then light transmitted by the object’s immediate sur- 
roundings.™ Suppose my only source of light is a window that has 
two shutters. When the left shutter is closed, I see it by virtue of how 
its right side is outlined against the remaining light. When both shut- 
ters are closed, I see neither shutter. 

The main reason for saying that the overshadowed smaller suitcase 
is invisible is that the smaller suitcase does not causally contribute to 
what is seen. The overshadowed suitcase also violates the require- 
ment that S nonepistemically sees O only if O “is visually different- 
ated from its immediate environment by S.” ” Consider a white moth 
on a white tree trunk The white moth causes part of what I see, al 
though not in a way that enables me to discern its outline. I am look- 
ing at the moth but I am seeing it. 

Now consider a perfectly dark moth on a perfectly dark tree trunk. 
This case differs from the cases involving back-lit occlusions. The 
dark moth does not have a silhouette. The scene is front lit. Indeed, 
J am shining a strong light right on the moth. The moth is absorbing 


" The temptation to reify holes can be strengthened with seductive arguments. 
These sophisms are beautifully illustrated and lovingly examined in Roberto Casati 
and Achille Varzi’s Holes and Other Superfiaahties (Cambridge: MIT, 1994). 

= “The Neurological Approach,” in R J. Hirst, Percephorn and the Physical World 
(New York: , 1965). 

™ Richard J. Hall, “Seeing Perfectly Dark Things and the Causal Conditons of 
Seeing,” Theor, 1 (1979): 127-34. 

© Dretkse, p. 20. For cnticism of the visual differentianon condition, see chapter 
four of Gerald Vimon’s Problems with Vision (New York: Oxford, 1997). 
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all the photons that land on it and hence is playing a causal role in 
what I am seeing. For instance, if the dark moth is atop a white spot, 
the moth prevents me from seeing that white spot. Thus, the dark 
moth satisfies the causal theorist’s necessary condition for seeing. - 
The moth is not an idler like Near. 

When the dark moth is on a white tree trunk, the moth is seen by 
the contrast it creates via its light absorption. Absorption is a well-un- 
derstood physical mechanism as evident from black-body research in 
physics. True, the white bark of the tree also contributes by reflect- 
ing light. But seen objects need not make the enkre causal contribu- 
tion. 

Transmission of light is not a sufficient condition for being seen. 
The stars shine as brightly during the day as at night. If you put the 
halves of a ping-pong ball over your eyes, you do not see any objects. 
Transmitters rely just as much on contrast as dark objects. Human 
beings have exquisite contrast sensitivity that strains the capabilities 
of electronic display devices, such as television and computer graph- 
ics displays. People can detect a change of modulation of less than 
one percent across a border.” This extreme sensitivity to differences 
in light intensity explains how the “darkness” of an object can itself 
be a pure contrast effect. Sunspots look dark solely because they are 
relatively cool spots on the sun. They would shine brightly if they 
could be removed from the even hotter photosphere of sun. The 
universality of this reliance on contrast explains why any object can 
be camouflaged by muting its contrast with its surroundings. Seen 
objects normally make a hołistic contribution to the scene. They are 
visible by virtue of the differences that they make with their sur- 
roundings. The differences can be achieved by going up or down the 
scale of light intensity—including all the way down to zero. 

The contrast between the silhouette and its background is muted 
by semi-opaque intermediates, such as window curtains. Since our 
eyelids are only semiopaque, it is possible to see some objects with 
our eyes closed. Close your eyes and position a bright light bulb so 
close to them that you can feel the bulb’s heat. Now wave a pen just 
in front of your eyelids. (Contrast sensitivity is enhanced by move- 
ment.) Seeing the pen’s diffuse silhouette may make you receptive 
to the hypothesis that purely contrast seeing is biologically prior 
(phylogenetically and perhaps ontogenetically) to seeing objects by 
their transmitted light. 


™ Robert Shapley and Dominic Man-Kit Lam, Contrast Sennitutty (Cambridge: 
MIT, 1993), p. xii. 
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All evolutionary accounts start with a proto-eye that is merely senst 
tive to the presence of light and then develops into a detector of 
passing shadows. The first seen objects should have been silhouetted 
figures. Only later would we expect objects to be seen by virtue of 
the light they transmit. Instead of being a marginal form of vision, 
purely contrastive seeing is the primal form of seeing. Stereotypical 
seeing is an elaboration of this more basic ability. 

Black holes amass the immediate surroundings that make them- 
selves discernible. The textbook scenario features a black hole that 
develops from a large star in a binary star system. The larger star 
has been pulling matter from the smaller star. After the large star 
accumulates a critical amount of matter, it collapses into a black 
hole. Since the black hole has the same mass as it had when a large 
star, it continues to pull matter from its partner star. Since this new 
matter conserves angular momentum, it forms an accretion disk 
that revolves around the black hole. The matter from the inner 
part of the disk accelerates up to nearly the speed of light. The fric- 
tion creates tremendous heat which in turn emits X-rays. Hence 
the black hole makes itself visible by piecing together an environ- 
ment for itself. 

Goldman rightly denies that there is a “transfer of energy or force” 
from the black hole to the observer (ibid., p. 282). But he wrongly 
denies that there is a physical mechanism by the which the black 
hole makes itself seen. The black hole causes other things to transfer 
energy to observers. During World War I bombing raids, high-flying 
bombers could see the silhouettes of low-flying bombers below: **. 
The low bombers were backlit from enemy searchlights. The low4fly- 
ing bombers were not directly transferring energy into the eyes of 
high-flying observers. But the lowflying bombers were still causing 
themselves to be visible by rousing the enemy below. 

Dark objects are a varied lot. Nevertheless, neither the overshad- 
owed suitcase nor the overshadowed heavenly body, Near, satisfy the 
causal requirement. True, I would see Near if Far vanished. But I see 
in virtue of actual causation, not hypothetical causation. Recall 
Harry Frankfurt’s™ counterexample to the principle that I am re- 
sponsible only if I could have done otherwise. An evil scientist has 
rigged up a device that will make me do an evil deed if and only if I 
fail to do it by my own accord. I do the evil deed, so the device does 
nothing. I am responsible for the evil deed even though I could not 
have done otherwise. 


» “Alternate Poasibilines and Moral Responsibility,” this JOURNAL, LXVI, 23 (De- 
cember 4, 1969): 829-39. 
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Compare Frankfurt’s case with Lewis’s™® censor. The censor is a 
device that will make me have the visual experience of a scene if and 
only if I do not have the visual experience in another way. I am hav 
ing a visual experience of the scene by opening my eyes and receiv- 
ing reflected light waves in the normal way. Am I seeing the scene? 

Lewis says I see if and only if “the scene before my eyes causes 
matching visual experience as part of a suitable pattern of counter- 
factual dependence” (ibid., p. 285). The censor illustrates a lack of 
suitable dependence: 


The case is one of causal preemption. The scene before my eyes is the 
actual cause of my visual experience: the censor is an alternative poten- 
tial cause of the same effect. The actual cause preempts the potential 
cause, stopping the alternative causal chain that would have otherwise 
gone to completion (thd, pp. 285-86). 


Substitute ‘Near’ for ‘the censor’. Although the censor is idle, its 
“idleness is an essential factor in the causal process by which match- 
ing visual experience is produced.... We cannot uniformly ignore or 
hold fixed those causal factors which are absences of intervention” 
(thid., p. 286). Lewis concludes that I do not see the scene. 

Lewis's verdict has been unpopular. His personal envoy, Bruce Le 
Catt’! has tinkered with the counterfactual dependency theory to ik 
luminate the possibility of securing a better match with intuition. 
But these changes do not address the main problem: underweight- 
ing the importance of physical mechanisms. As Brian McLaughlin™ 
stresses, I see because the censor is an external faibsafe device that 
does not prevent me from exercising my capacity to see. 

Let me turn the topic temporarily from nonepistemic seeing to 
perceptual knowledge. The eclipse riddle does not fit into the tradt 
tion of skeptical counterpossibilities. It is not like cases involving 
dreams, brains in vats, or fake barns.™ I have access to all the rele- 
vant facts. When I observe Near and Far approach and then inter- 
sect, I know they are behaving just as astronomers predicted many 
years ago. My situation differs from that of a passerby who had only 
heard of Near and just happens to look up at the moment of inter- 


» “Veridical Hallucination and Prosthetic Vision,” in Phelosophscal Papers, Volume 2 
(New York: Oxford, 1986), pp. 278-86; bere p. 285. Originally published in Aus- 
tralanan Journal of Pht LVE (1980): 239-49. 

* “Censored Vision,” Australasian Journal of Philosophy, LX (1982): 158-62. 

™ “Lewis on What Distinguishes Percepton from Hallucination,” in Kathleen 
Akins, edL, Pl cna RW Nie Oxford, 1996), pp. 198-231. 

* Goldman, “Di ination and Perceptual Knowledge,” this JOURNAL, LXXM, 20 
(November 18, 1976): 771-91. 
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section. He might be surprised to learn that the dark shape might be 
caused by Far. Not me; I have been preparing for the event for years 
and know all the empirically relevant facts. 

The only threat to my perceptual knowledge of Far is conceptual 
unclarity. If I fail to believe that I am seeing Far, then I do not 
know that I am seeing Far. Astronomers saw dark nebulae long be- 
fore they had perceptual knowledge that dark nebulae are clouds 
of interstellar dust. They had surprising difficulty excluding the 
possibility that the dark regions in the sky are tunnels through 
which we look out beyond the stars of the Milky Way into inter- 
galactic space. 

VIL CAUSE, RESEMBLANCE, AND VISUAL DETECTION 

We are tempted to say we see Near because we seem to see its outline 
(and so appear to satisfy Dretske’s condition of visual differentia- 
tion). The influence of the outline can be gauged by considering a 
scenario that increases the relative size of Far. The huge shadow 
would make Near invisible. Those tracking the movement of Near 
would say that they had lost sight of Near as it is submerged into the 
massive umbra of Far. 

The extent of overshadowing is irrelevant to occlusion as long as 
the entire object is covered. But the size of the shadow is relevant to 
visual detection. A thief who hides in a shadow that perfectly coin- 
cides with his shape is not as well concealed as one who enjoys cover- 
age by a massive shadow that is not congruent with his shape. 

When Near and Far have their original dimensions, the trackers of 
Near would be able to see that Near is aligned with Far even though 
they cannot see Near. To see an object, the object must be causally 
responsible for the visual information. This requirement explains 
why I do not see Mario Andretti when looking at his identical twin 
Aldo Andretti. A photograph of Aldo Andretti is not a photograph 
of Mario Andretti, even if their resemblance is exact. The as- 
tronomers are seeing an object that has the same look as Near. This 
resemblance ensures that what they see gives them all the visual in- 
formation about Near that would have been available had Near 
caused the image itself. Hence, the astronomers tracking Near are 
reliably guided by the resemblance just as a reporter tracking Mario 
Andretti is reliably guided by a photograph of his identical twin Aldo 
Andretti. 

Visual detection does not suffice for seeing. Astronomers first visu- 
ally detected planets outside our solar system by observing perturba- 
tions of stars through ground telescopes. Only later, with the help of 
the Hubble telescope, did they see a planet outside our solar system. 
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VII THE COLLAPSE OF THE COMPOSITE ANSWER 

The Far answer is unaffected by the third possibility that I am seeing 
a composite object whose parts are Near and Far. This mereological 
sum, Near+ Far, is arbitrary but no more arbitrary than scattered ob- 
jects, such as constellations. We do speak of seeing the moon even 
when we do not see the far side of moon. Seeing a relevant, properly 
attached part of an object counts as seeing the object simpliciter.* 
Seeing behaves just like most transitive verbs in this respect; to 
scratch a relevant attached part of an object is to scratch the object 
itself. Near is the front of Near+Far and Far is its back. If I see the 
front of Near+Far, then I see Near. But then the simple answer, that 
I see Near, would also be correct. The appeal to mereological sums 
would also be otiose if the simple ‘I see Far’ answer is correct. For if I 
see the back of Near+Far, I see Far. The composite answer is correct 
only if one of the simple answers is correct. 

A polarizing light filter casts no shadow. But if a second filter of op- 
posite polarity is placed beneath, then the pair jointly cast a shadow. 
Someone viewing an eclipse involving two such filters might qualify as 
seeing their mereological sum. Both the near filter and the far filter 
would each contribute to what is seen. In contrast, the heavenly body 
Near is completely idle. The same phenomenon arises for some ob- 
jects that reflect ight. Glass mirrors also polarize light Hence glass 
mirrors darken when viewed through a polarizing filter even though 
everything else in the scene looks perfectly ordinary. The dark mir- 
rors are visible by virtue of their contrast with their surroundings. But 


unlike Near, the mirrors causally contribute to the scene. 
IX VISUAL DIFFERENTIATION AND THE FAR SIDE 


Picture Near and Far coming closer together until they fuse. I still 
see Far. This reinforces the point that we see the backs of back-lit 
dark objects. When the moon eclipses the sun, only the far side of 
the moon blocks the light. The front half is in the shade. Thus we 
see the far side of the moon during a solar eclipse. 

True, little of the far side can be differentiated by purely con- 
trastive seeing. Only the outer rim offers positive detail. The middle 
only supplies the negative information that there are no tunnels run- 
ning completely through the moon. The silhouette of a fine mesh 
screen has a much higher proportion of positively detailed surface. 

We distinguish between observing an effect of an object and seeing 
the object. The star Algol, in Perseus, has a companion star that is too 
dim to be seen. Since the plane on which these two stars revolve is ort 


” McLauglin, “Perception, Causation, and Supervenience,” Midwest Studies m Ph+ 
lasophy, IX (1984): 569-92, here pp. 580-85. 
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ented nearly edge on to our line of sight, each star is eclipsed by the 
other during every revolution. Although the effect of Algol’s companion 
can be discerned by how it periodically dims Algol’s light, astronomers 
describe Algol’s companion as invisible. Astronomers cannot even dis- 
cern its outline. The object must look some way to the perceiver.” 

The analogy with Algol’s companion suggests that during a solar 
eclipse, we see an effect of the moon but not the moon. The distinc- 
tion between seeing and seeing effects can also be drawn in another 
way that appears to yield the common-sense answer that we see the 
moon but not the far side of moon. “Seeing the search beam of a 
lighthouse may count as seeing the lighthouse and not as seeing the 
bulb inside the searchlight.” The suggestion is that the moon 
stands to its far side as the lighthouse stands to its bulb. 

The far side is not as obscured as the bulb, however. Aristotle ar- 
gued that earth must be round because it always casts a round shadow 
on the moon during lunar eclipses. We have doubts about whether we 
see the earth this way because its shadow is displaced; the earth’s 
shadow is on the surface of the moon but the earth is not. Similar 
reservations affect seeing objects by mirror reflection. But during a so- 
lar eclipse, the silhouette of the moon is not displaced. The informa- 
tion we get from the moon's silhouette is normally “discounted” by 
our ample background knowledge of the moon. The empirical cor 
tent of the silhouetted moon is salient, however, when the observers 
have unusual background beliefs. Suppose we are worried that the 
moon was struck by a giant meteor during the daytime when it is invis- 
ible. Instead of waiting until night to find out, we fortunately have 
the opportunity to check the moon’s condition by observing an af- 
ternoon eclipse: “What a reliefl There the moon is, right on sched- 
ule, just where it belongs, entirely intact. The meteor must have 
missed!” 

Actually, the detail from the far side during an eclipse can be 
greater than that offered by the moon’s reflected light during a hazy 
evening. In an 1836 eclipse, Francis Baily noticed some beac-like fea- 
tures along one side of the moon. “Baily’s beads” are the peaks of 
mountains that lie along the profile of the moon. If the moon re- 
volved with respect to Baily and if the eclipse lasted long enough to 
complete one revolution, Baily could have surveyed all of the moon's 
peaks. But the moon always presents the same side to us. Whyr Be- 
cause the near side bulges about .5 kilometers and so is pulled more 
strongly by the earth’s gravity. If a meteor had knocked off this bulgy 


* Dretake, p. 20. 
= “Perception, Causation, and Supervenience,” p. 584. 
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excess, the moon might have been sent spinning fast enough to 
make silhouette astronomy informative. 

Granted, even Baily’s hypothetical survey hardly compares to the 
rich visual information relayed by the spacecraft, Luna II, in 1959. 
But even the proud Soviet astronomers granted that, under certain 
conditions, we very dimly see a portion of the “dark” side of the 
moon by earthlight (a phenomenon popularly known as “the old 
moon in the new moon’s arms”). 

When I see the moon in normal conditions at night, I see the 
moon by virtue of seeing its front. Sticklers deny that I see the moon; 
I see only the front surface. The same sticklers would say that on my 
account, we do not see the moon when it eclipses the sun; we see 
only the back surface. J admire the insight and attention to detail 
that motivates the stickler. But J am not a stickler. I say we see the 
moon during normal conditions at night and that we see the moon 
during an eclipse. But I need insist only that the stickler’s standards 
be applied uniformly to both cases. 

Suppose Near and Far have the same intrinsic diameter. Since Far is 
further away, it then only prevents light from striking the middle of 
Near. The stickler will say that I see a “doughnut” composed of an 
outer ring of Near and an inner core of Far. It is doubtful, however, 
that I see Far in this circumstance. I cannot differentiate Far from its 
immediate environment. Do I see a little star if it is stationed in front of 
the sun? If the two are equally lummescent, I cannot differentiate the 
little star from its immediate environment. Suppose a wall has beige 
wallpaper. If someone neatly pastes a patch of leftover beige wallpaper 
on the wall, do I see it? Following Dretske (op. at, p. 23), I answer no. 

I can clearly discern the outline of Near. True, I do not see its center 
because it is blocked by Far. But the block is not the sort that makes a 
difference to the visual match with the scene; if Far were absent, the 
scene would look the same. Since J am a nonstickler, I say that in that 
circumstance, I see Near because I see a relevant, attached part of it. 

Parallel, duplicate blockers are unexpectedly common in our car- 
pentered world of rectangular volumes. Consider a silhouetted box. 
Although its front side has the same dimensions as its back side, the 
front side is farther from the light source. Hence the shadow gener- 
ated by the back side fails to include the outermost edge of the 
front. Consequently, perimeter of the silhouette is actually caused by 
the front side of the box. The stickler concludes we see a rectangular 
doughnut. But since the outline of the back side is invisible, the back 
side fails to satisfy the visual differentiation condition. I conclude we 
see the front of the silhouetted box. (More precisely, the relevant ab- 
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sorption layer starts from the inside of the front side.) Boxy shapes 
are rare in nature. Nevertheless, they constitute an important excep- 
tion to the rule that we see the backs of silhouetted objects. 

If I am standing with my nose against a concrete wall, then I see 
the wall or at least a portion of it. Yet I am not visually differentiating 
the wall from its immediate environment. Dretske treats this anom- 
aly as a limiting case (op. cit, p. 26). He reasons that since there is no 
immediate environment, the requirement of visual differentiation 
does not apply. Interestingly, there is no parallel anomaly for the vr 
sual differentiation requirement in the case of purely contrastive see- 
ing. If I am hurtling toward the moon during a solar eclipse, the 
black disk grows bigger and bigger, and then eventually overflows my 
visual field. There is only darkness. I know the moon is directly 


ahead of me. But I do not see it. For I need light to see. 
X. THE CAUSAL THEORY’S RECOVERY OF PRIMAL VISION 


I now summarize my main theses. Instead of adjusting principles to fit 
the intuition that I see Near, we ought to disown this piece of common 
sense as the effect of an overgeneralization. The causal theory of per- 
ception unexpectedly but correctly rules out the possibility that we see 
Near. Its support for Far forces us to recognize that, typically, we see 
the backs of dark, backlit objects, not their fronts. So ‘occlude’ must be 
relativized. That forces a corresponding relativization of ‘field of vr 
sion’. Thus the causal theory of perception brings an ancient species of 
seeing to the surface. Purely contrastive seeing is a living fossil that has 
everyday seeing as a descendent. Some minor adjustments are needed 
to accommodate this recovered form of seeing. Our conception of the 
physical mechanisms that support vision must be broadened to encom- 
pass the various kinds of visible dark objects. To include back-lit ob- 
jects, we must allow partial light blockage to count. Objects are not seen 
simply by virtue of a contrast with their environment. They must cause 
the contrast. That is the difference between Near and Far. 

Intersecting eclipses can form from a fortuitous convergence of 
heavenly bodies. But they also form in a cyclical fashion. As the 
moon orbits earth, the earth eclipses the sun. The scene would be 
reminiscent of the famous photograph “Earthrise” taken from the 
vantage point of the moon. Now take a giant step back from moon 
by imagining how the scene looks to an astronaut in a high orbit 
around the moon. The earth is then aligned with the moon and the 
astronaut has both of them in his line of sight. Near is the moon. Far 
is the earth. 

ROY SORENSEN 
New York University 


50 THE JOURNAL OF PHILOSOPHY 


BOOK REVIEWS 


The Logic of Provabikty. GEORGE BOOLOS. New York: Cambridge Univer- 
sity Press, 1993. xxxvi + 275 p. Cloth $69.95, paper $24.95. 


This is a revised and expanded version of the late George Boolos’s 
The Unprovability of Consistency: An Essay în Modal Logic.’ Like its pre- 
decessor, it exploits very well the opportunity to present a very attrac- 
tive little theory in modal logic in terms accessible to a wide class of 
readers with some knowledge of first-order predicate logic. (Chap- 
ters thirteen and fourteen demand a bit more, but can be skipped 
without other loss.) 

A thirty-six page introduction offers a survey of the content of the 
book. For another excellent summary, in the context of some history 
of the field and especially of the author’s contribution to it, see the 
Afterward, written by John Burgess, in the recently-published collec- 
tion of Boolos’s essays.* 

The first twelve chapters, covering roughly the subject matter of 
the earlier book, along with chapters thirteen and fourteen, concern 
certain systems of modal propositional logic (MPL). The formulas of 
MPL are built up from sentence letters p, q, t, and the like, and the con- 
stant L (absurdity) using the connectives > (implication) and L] (ne- 
cessity). For example, the negation “A can be taken as an 
abbreviation for Á > L. A system of MPL is called normal if its set of 
theorems is closed under modus ponens, necessitation (that is, if A is 
a theorem, then so is LJ A) and substitution of arbitrary formulas for 
sentence letters, and includes all tautologies and all instances of the 
distributive laws L] (A > B) ~ (OA CIB). The least normal system is 
denoted by K. Chapter one is devoted to the relative strengths of 
some natural normal systems. 

Chapter four develops Kripke semantics for normal systems. A 
frame is a pair <W, R>, where Wis a set (of “possible worlds”) and Ris 
a binary relation on W. (wRw’ means that, from the perspective of 
world w, world w’ is possible.) A model M consists of a frame <W, R> 
together with a relation V between worlds in Wand sentence letters. 
Relative to M, the notion that a formula of MPL is true in a world w 
E Wis defined as follows: (i) the sentence letter p is true in w if and 


1 New York: Cambridge, 1979. 
t Logic, Logic, and Logic: Tha Collected of George Boolos, with Introductions 
and Afterward by Jobn Burgess, Richard J , ed. (Cambridge: Harvard, 1998). 
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only if wVp. (ii) L is not true in w (iii) A > Bis true in w if and only 
if either A is not true in wor Bis true in w. (iv) UA is true in wif and 
only if ais true in w for every w such that wRw. Call A valid in M if 
and only if it is true in every world relative to M, call it valid in a 
given frame if and only if it is valid in every model with that frame, 
and call it simply valid if and only if it is valid in all frames. Given a 
model M and a world w of M, if the premises of modus ponens are 
true in w, then so is the conclusion. If A is true in every world in M, 
then so is LJA and, if A is valid in a frame, then so is each of its sub- 
stitution instances. In particular, every theorem of Kis valid. Many of 
the standard normal systems, initially picked out because they corre- 
sponded to some intuition about what “necessity” might mean, turn 
out to be characterized by some simple property of the relation R, in 
the sense that their theorems are precisely the formulas of MPL 
which are valid in all frames (called the frames appropriate to the sys- 
tem in question) in which the relation R has this property (chapter 
five). For example, corresponding to the system T obtained from K 
by adding as axioms all instances of [JA —> A is the property that R 
should be reflexive on W. Chapter six gives a construction of a canon- 
ical model for each consistent normal system L, that is, a model in 
which the valid formulas are precisely the theorems of L. 

In provability logic, CJA is intended to mean that the formula A “is 
provable.” To make this precise, a particular formal system must be 
specified in which it is to be proved. The simplest case, treated in the 
most detail, is the system of so-called Peano Arithmetic (PA). If the re- 
alization™ assigns to each atomic formula letter p of MPL a sentence 
p* of PA, it extends uniquely to all formulas of MPL by taking L* to 
be L itself, (A > B)* to be A* > B* and (DA)* to be Bew (‘A*’), 
where ‘B' is the Godel number of B and Bew (x) is the standard 
arithmetical formula expressing that x is the Godel number of a for- 
mula which is provable in PA. To use Burgess’s terminology, we call 
a formula A of MPL apodictic if and only if A* is provable in PA for all 
* and we call it veracious if and only if A* is true for all *. 

The necessary information about PA and its provability predicate 
is outlined in chapter two and some elementary facts about the inter- 
pretations * are in chapter three. In particular, it is shown there that 
every theorem of the normal modal system (GL, named for Kurt 
Gödel and M. H. Lob) is apodictic and that every theorem of the 
normal modal system (GLS, ‘S’ for Robert Solovay) is veracious. 
Chapter nine contains Solovay’s proof that the apodictic formulas 
and the veracious formulas are precisely the theorems of GL and 
GLS, respectively. It uses the fact (proved in chapter five) that the 
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frames appropriate to GL are those in which R is transitive and there 
are no infinite sequences w, Rw, Rw,.... There are analogous results 
involving other normal systems of provability logic, again due to 
Solovay, where, instead of PA, the proof predicate is for Zermelo- 
Fraenkel set theory (chapter thirteen) and second-order number 
theory—so-called ‘analysis’ (chapter fourteen). 

Chapter eight contains three proofs of the Fixed Point Theorem, 
which expresses in modal logical terms the slight-of-hand aspect of 
Godel’s theory of undecidable sentences: let A(p) be a formula of 
MPL and assume that all occurences of the letter pin A(p) are within 
the scope of []—so A(p) says nothing directly about p, only about 
the provability of sentences containing it. Then we can effectively 
find a formula B of MPL containing no sentence letter that is not in 
A and not containing p such that, writing CJ A for DIAN A: 


Li (pe A) eO (pe B) 


is a theorem of GL. For example, one algorithm for constructing B 
from A, together with Solovay’s theorem, yields, with S any sentence 
of PA and + meaning provable in PA: 


H Se~ Bew('S") ifand only if Se = Beat" 1") 
and: 
L Se Bew" S’) ifand only ft Semn L 


Chapters fifteen and sixteen discuss bimodal logic, obtained by in- 
troducing a second modal operator [1]. The operation * assigning 
sentences of PA to sentence letters is extended to all bimodal for- 
mulas by taking (HJA)* = wBew("A*'), where wBew(x) expresses the 
property of being provable in PA with one application of the w-rule. 
Again, there is a system GLB of bimodal logic whose theorems are 
precisely the apodictic formulas, although the proof in this case 
cannot proceed from a simple Kripke semantics for the system. 

The final two chapters, seventeen-cighteen, treat quantified prov- 
ability logic. Here, instead of the formulas of MPL, we have the for- 
mulas of QML, built up as usual in first-order predicate logic, except 
that we admit, given the formula A, also the formula CJA. As in the 
propositional case, we can introduce the notion of a realization * for 
this system, namely, an assignment to each predicate letter P of n 
variables of a formula P* all of whose free variables are among the 
first n in a fixed list of the free variables. * is uniquely extended to as 
sign to each formula A of QML a formula A* of PL with the same 
free variables. In this case, however, neither the class of apodictic 
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formulas nor the class of veracious formulas forms a finitely (or re- 
cursively enumerably) axiomatized system. 

I used the book as text for a ten-week course on provability 
(propositional) logic for a class of both graduate philosophy stw- 
dents and undergraduates, some but not all of whom had already 
had a course covering Godel’s incompleteness theorems. The book 
was at good level for them, although the students who had not had 
previous exposure to the arithmetization of syntax found chapter 
two hard going. There is a sprinkling of good exercises, but they 
probably ought to be supplemented in such a course. 

A list of typos: page xxiii, line 27, “n” for “n” and on line, “)x” for 
“x)”"; page 12, Theorem 22, ~.. for the first occurence of L; page 14, 
line 6, delete “by Theorem 21 (with L for p),” line 7, “L” for the last 
occurence of “p,” line 8, “1” for the last two occurences of “p,” lines 
9 and 10,“L” for “p”; page 19, line 1, replace “(non-logical axioms)” 
by “(norlogical) axioms”; page 23, line 5, “F(x)” for “Fx”; page 31, 
line 26, “+” for “i”; page 44, line 6, “Yngi” for “syy1°; page 45, line 
7, “y” for “var(y)"; page 47, lines 35-36, the case in which F is the 
negation of one of these equations also has to be treated; page 51, 
line 13, “realizations” for “interpretations”; page 63, line 14, “incon- 
sistent” for “1-consistent.” 

WwW. W. TAIT 
University of Chicago 
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MEMO TO AUTHORS 


The editors cannot afford to send to the printer manuscripts that are 
difficult for the compositor to read or footnotes that need to be 
completed in proof. They ask that manuscripts submitted be pre- 


pared according to the following guidelines: 


(1) All material, including the quotations and footnotes—especially 
footnotes—muast be typed, double-spaced, on one side of the paper, 
with ample margins left and right, top and bottom. 

(2) No bibliographies will be published; bibliographical material 
should be put into the footnotes. 

(3) Footnotes should be complete, according to Journal style (see any 
issue). Journal citations should include name of journal in full, vol- 
ume number, issue number, month or season, year, and pages. 
Book references should include publisher, aty of publication, and 
date. Footnotes should be gathered at the end of the article. 

(4) Quotations should be checked by the author; capitalization, italics, 
spelling, and the like should match the onginal. Corrections should 
not have to be made (expensively) in proof. 

(5) If there are diagrams, they should be provided camera-ready by the 
author. If there are special symbols, they should be widely spaced 
and clearly marked; if there are many, a list should be provided. 
Standard logic symbols are available, and letters of the Greek and 
script alphabets; no boldface. 

(6) Computer copy can be extremely difficult to read; only letter- 
quality is acceptable. It should be double-spaced and not justified. The 
print should be dark enough and the letters widely enough spaced 
so that both letters and punctuation marks can be read easily. 


The editors are willing to read and evaluate manuscripts that do not 
meet these standards, provided they are genuinely legible and not 
unreasonably long. They may ask for revision of excessively long 
manuscripts, and they require reformatting of accepted manuscripts, 
if necessary, before scheduling for publication. 


The Journal of Philosophy is published monthly, in issues that (ordi- 
narily) contain only 56 pages of copy; this limits the length of papers 
the editors can normally publish to no more than 7,500 words, or 
eighteen Journal pages. A more precise way of determining length is 
by characters: 2,850 characters per Journal page, or 51,300 charac- 
ters per Journal article of 18 pages. 


The Journal of Philosophy does not publish papers that are primarily 
historical or expository, the editors urge that such manuscripts be 
submitted elsewhere. 
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rust enters into the making of a virtuous person in at least two 

ways. First, unless a child has a sufficiently trusting relationship 

with at least one adult, it is doubtful that she will be able to be- 
come the kind of person who can form ethically responsible relation- 
ships with others. Infant trust, as Annette Baier’ has reminded us, is 
the foundation on which future trust relationships will be built and 
when such trust is irreparably shaken, the adult into whom the child 
grows may be forever cut off from intimacy. Second, a moral beginner 
must trust other people’s moral judgment while she learns to be good. 
A child does not learn how to be good on her own. She follows the 
precepts and examples of others, absorbs the moral values of the com- 
munity into which she has been born, and relies on others who, with 
luck, will not lead her astray but will turn out to be trustworthy in of 
fering her guidance and correction. While there is some empirical evi 
dence that children learn about morality better when given reasons in 
support of what they are being taught, part of what a child is being 
taught is what counts as a good reason. Thus, no moral beginner could 
be in a position to assess adequately the cogency of the reasons she is 
given until she has progressed some distance in her education.’ Until 
that time, the beginner must rely on other people’s judgment. 


* This paper was written while I was a fellow at Cornell University’s Society for 
the Humanities; I would hke to thank the Society for 1ts support The paper was pre- 
sented at Different Voces: A Conference m Honor of Annatie Baier, held at the University 
of Pittsburgh ın 1995, and to audiences at Cornell, the Unrvernty of Cincinnaty, and 
the 1996 Australasian Association of Philosophy Annual Conference—each audi- 
ence contributed to the final shape of the paper Special thanks to Jennifer Whit 
ing, Lisa Rivera, and Donald Ainslie for written comments on earher vermons. 

' “Trust and Anti-trust,” Ethics, xcvi (1986) 231-60. 

* Some accounts of moral education (for example, Anstotle’s) will give a greater 
role to trust than others do 
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That trust enters into the making of a morally good person should 
thus be uncontroversial. I want to defend a much more controversial 
thesis: needing to trust other people’s moral judgment is not just a 
stage we go through on the way to becoming morally virtuous. If the 
world of value is complex, and if our access to it is shaped by our expe- 
riences, then even among the morally mature there will continue to 
be a significant role for moral testimony and thus for trust. We should 
not think of our reliance on other people’s moral judgment as like 
training wheels for the novice cyclist, wheels which can and ought to 
be dispensed with, once we have the knowhow and the confidence to 
ride off on our own. Often, we cannot do as well on our own as we 
could do if we accepted the moral testimony of others. Moreover, just 
as borrowing scientific knowledge can enhance our capacity to dis- 
cover truths about the nonmoral world, borrowing moral knowledge 
can enhance our capacity to understand the world of value.’ 

I am going to start by giving a sketch of what I take to be the best ar- 
guments against testimony as a source of moral knowledge. Next, I 
give a detailed reallife example which aims to make us suspicious of 
the case against testimony. At the same time, the example tries to be 
fair to the considerations against testimony and to show why we might 
find morally attractive someone who resists such borrowing and re- 
fuses such trusting. Armed with the example, I then return to reexam- 
ine the initial case against moral testimony. In the final section, I 
argue that making room for moral testimony brings issues of trust to 
center stage. The wise truster of moral testimony begins from a default 
stance of distrust, yet can come to trust and, through trusting, can 
come to have moral knowledge that would otherwise be inaccessible 


to her. 
I. THE CASE AGAINST TESTIMONY IN MORAL KNOWLEDGE 


It has seemed simply obvious to some, such as Robert Paul Wolff,‘ 
that there is no place for testimony about moral matters: 


* Throughout this paper, I am going to speak of “moral knowledge” and some- 
times even of “moral facts,” but this is simply for convenience of expression While 
I do think that only a realist account of morality can adequately accommodate our 
moral sano Baye it js no part of my brief to argue for that here Quastrealists (for 

Simon Blackburn, in Spreading the Word (New York. Oxford, 1984)) and 
norm eet (for example, Allan Gibbard, in Wise Choices, Aft Feettngs (Cam 
bridge: Harvard, 1990)) can each offer nonrealist reconstructions of this sort of 
talk Only on a very simple subjectvist view will such talk be ilHcit. And I take it— 
admittedly without argument—that such a view is no longer a contender. 

1 In Defense of Anarchism (New York: Harper, 1970); also cited ın Gerald Dworkin, 
The Theory and Practycs of Autonomy (New York: Cambridge, 1988), p. 52 John Hard- 
wig shows how Wolff 1s wrong about contemporary mathematics, given the prolifer- 
ation of mathematical specialties; see his “The Role of Trust in Knowledge,” this 
JOURNAL, LXXXVII, 12 (December 1991): 698-708. 
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He [the responsible man] may learn from others about his moral obliga- 
tions, but only in the sense that a mathematician learns from other 
mathematicians—namely by hearing from them arguments whose valid- 
ity he recognizes even though he did not think of them himself. He does 
not learn in the sense that one learns from an explorer, by accepting as 
true his accounts of things one cannot see for oneself (sd, p. 13). 


We need to rely on the word of an explorer returning from strange 
foreign lands because we cannot have exactly his reasons for believ- 
ing what he says. “We saw it with our own eyes’ cannot be our reason 
for believing what he says. Unless we go there ourselves, we must rely 
on his telling us what he saw. 

With morals, it might seem to be otherwise: we give reasons for our 
moral views and these reasons can be shared and so can be evaluated 
by each of us.° As a crude first approximation, to be refined later, tes- 
tifpng that p contrasts with arguing that p insofar as it is the testifier 
herself who vouches for the truth of ~; someone who argues that p 
lets the arguments vouch for themselves.’ So long as we can give ar- 
guments for our moral positions, it seems that we do not ourselves 
have to vouch for the truth of those positions. On this view, moral 
testimony can have a significant place only for beginners who cannot 
manage to evaluate the arguments. For the rest of us, moral test 
mony can be useful at most only when we do not have enough time 
to give careful consideration to all the arguments relevant to a moral 
position. We might, then, because of time constraints, rely on the 
moral testimony of those whose character we trust and who we know 
have been able to give careful consideration to the arguments. 

Three additional considerations seem to roll back even further the 
place for testimony about moral matters. 

First, moral matters are important and engage our responsibility 
in a way that nonmoral matters do not. We are responsible for avoid- 
ing errors here in a way that we are not responsible for avoiding 
them in other areas. Further, moral judgments reflect on our charac- 
ter in ways that other judgments do not. The extra importance at- 
tached to moral judgments derives both from their close connection 


*C A J. Coady mentons this as an important consideration against testimony 
regarding moral matters; see his Testimony: A Paslosophacal Study (New York Oxford, 
1992), p. 71. 

* This account of testifying is not quite the same as that offered by Coady. He 
builds speaker competence into his definition of tesufying (op at, p 43). Ehze 
beth Fricker nghtly takes him to task for this, since it shifts the interesting episte- 
mological questions to the question of whether someone was in fact testifying that 
p—“Telling and Trusting: Reductionism and Antereductionism in the Epistemol 
ogy of Testimony,” Mind, civ (1995) 398-41 
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with action and from the importance of the concerns that morality 
addresses. 

Second, moral knowledge is supposed to be prachcal knowledge. It 
is not enough simply to know, for example, that sexism is wrong. 
One must be able to put that knowledge into practice.’ But, insofar 
as one must rely on other people's judgment as to when ‘sexism’ ap- 
plies, one’s knowledge of the wrongness of sexism will be practically 
deficient. Similarly, unless one knows not only that sexism is wrong 
but also why it is wrong, one will not be able to generalize from the 
wrongness of sexism to the wrongness of other sorts of discrimina- 
tion. Because of this, one’s knowledge that sexism is wrong will not 
reverberate throughout one’s actions and will not extend itself to 
other practical spheres. 

Third, borrowing moral knowledge appears to be in tension with 
the ideal of an autonomous moral agent. On some accounts of au- 
tonomy, the tension will be straightforward. If we think of autonomy 
as requiring independence from others, then those who borrow 
moral knowledge must, by that fact alone, fail to be autonomous.’ 
On this picture, being autonomous presupposes what we might call 
“intellectual individualism.”*® Such accounts of autonomy have been 
roundly criticized, for it is hard to see why autonomy, conceived as 
independence, should be something valuable. 

A more plausible account of autonomy views autonomy as involv- 
ing, among other things, the capacity to reflect critically on the prin- 
ciples that one adopts. Through such reflection, one comes to make 
those principles one’s own.” But it is easy to see why this account of 
autonomy, while not as obviously incompatible with borrowing 
moral knowledge as autonomy-as-independence views, might 
nonetheless be in tension with it. If we have borrowed our moral 
knowledge from others and cannot offer reasons in support of it, 
then how can we reflect critically on that knowledge? Moreoever, if 
we do not understand the rationale behind those borrowed princi- 
ples, how can we use our borrowed principles to reflect critically on 
other principles we accept? How do we know the ways in which those 


’ This point u made by G. E. M. Anscombe, in “Authority in Morals,” repnnoted 
in The Collected Philosophical Papers of G. E M Anscombe (Minneapolis Minnesota 
UP, 1981), pp. 4350, here pp. 47-48. 

* For this view, see Wolff and Thomas Scanlon, “A Theory of Freedom of Ex- 
pression,” Philosophy and Pubic Affatrs, 1 (1972): 204-26. For cogent criticisms of it, 
see Dworkin, pp. 22-25. 

* This term is from Sigurdur Kristinsson’s Ph D. dimertanon, The Natures and 
Value of Individual Autonomy (Cornell University, 1996), p. 129. 

'* Dworkin gestures toward such a requirement (op. at, p 20). 
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principles connect with others, or the ways in which the considera- 
tions that support them align or fail to align with the considerations 
that we take to support our other principles? And if we cannot bring 
our borrowed knowledge into play in rationally assessing our other 
moral commitments, then those commitments will not be given the 
critical scrutiny they deserve. 

We began by thinking that since moral views are held for reasons, 
moral argument should be our primary source for moral knowledge. 
Testimony can have a place, though a small one, because we do not 
always have time to examine adequately the reasons in favor of a 
moral view. Now it looks as though it is incumbent on us, as consci 
entious agents, to limit our dependency on the moral testimony of 
others as much as we can. If, in the end, we sometimes still have to 


rely on testimony, that is a regrettable consequence of our finiteness. 
N. AN EXAMPLE THAT REOPENS THE CASE AGAINST MORAL TESTIMONY 


The case against testimony about moral matters appears to be 
strong. It is easy to construct examples that show, however, that the 
case against testimony must be less than fully compelling. Here is 
one such example. 

Peter had lived in a fourteen-person cooperative house for two 
years and was deeply committed to the ideal of cooperative living. 
His first year there, he had felt uneasy about the cooperative’s mem- 
bership decisions, but the second year he found them intolerable. 
Membership decisions were made on the basis of quite lengthy inter- 
views, and decisions to accept someone had to be unanimous. Any 
member could exercise a veto over a potential member for any rea- 
son, though in practice such vetoes were rarely necessary, as support 
for a potential member who might have incurred a veto generally 
evaporated. Peter came to believe that the threat of veto was making 
membership decisions arbitrary and unjust. White women and 
women of color had advocated rejecting three white men on the ba- 
sis of their perceived sexism, and, in one case, racism. Peter had a 
settied and serious commitment to the elimination of racism and 
sexism, but he was not very good at picking out instances of sexism 
and racism. Thus, his commitment often remained theoretical and 
he saw fewer occasions as calling for antisexist or antiracist response 
than he might have seen. Such blindness can sometimes indicate in- 
sincerity, but in Peter’s case it did not. He genuinely did want to un- 
derstand the perspective of the women and he wanted to be able to 
share it, if indeed it turned out to be correct. He could pick out 
egregious instances of sexism and racism, and could sometimes see 
7 “sexist” or “racist” applied to more subtle instances when the 
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reason for their application was explained to him, but he seemed 
bad at working out how to go on to apply the words to nonegregious 
new cases. Problems arose for Peter when he could not grasp the 
reasons why the women were calling someone sexist, when he either 
could not see, or could not see as evidence, the considerations that 
the women thought supported viewing the would-be members as sex- 
ist. Discussion tended to trail off: 


“No, ıt wasn’t anything he said, it was the way he looked at me 
when I asked him a question.” 

“How do you mean?” 

“Well, it wasn’t that he didn’t look at me and he didn’t stare at 
my breasts, it was just a way of looking—a way of not quite being present 
as he answered me.” 

“What way? I do not understand what way you mean.” 

“Well, it was just a feeling I got. A feeling that he didn’t think me 
important.” 

“But you can’t reject someone for a feeling.” 

“I’m not, I’m rejecting him because he’s sexist.” 

Peter decided to leave the co-op. His reason for leaving was that, 
as a member, he felt he had to be able to endorse the decisions that 
the group was making—they had to be decisions that were “his” in 
the sense that he could identify with them, even if they were the re- 
sult of compromise and negotiation. 

The women in the house wanted to say, and did say: “Look, just 
trust us, we know about this.” But to Peter, such trusting was an abdi- 
cation of moral responsibility. Even worse, from Peter’s point of 
view, the decisions about which men to accept were arbitrary and dis- 
criminated against the men rejected. Nor could such discrimination 
be justified on the basis of some goal it might achieve; it was not, for 
instance, a matter of increasing the presence of women and people 
of color in the co-op. It was, from his point of view, prejudice, pure 
and simple. 

I think that in this case Peter should have been willing to accept 
the women’s testimony that these men were sexist. But for him to 
have done so would have been to accept and act upon a moral judg- 
ment on someone else’s say-so. It would have been to borrow moral 
knowledge. 

At this point, it might be objected that the case is misdescribed. 
Peter was not being asked to rely on other people’s moral judgment 
at all. Instead, he was being asked simply to accept that a purely de- 
scriptive term, ‘sexist’, applied. If we are inclined to think it not out- 
rageous that he should have deferred to the women of the house, a 


f 
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is because it is reasonable to suppose that they might be better able 
than he to apply the descriptive terms ‘sexist’ and ‘sexism’. But, the 
objection continues, this is no different from our deferring to biolo- 
gists over when species terms apply, to astronomers over astronomy 
terms, and so on. It is true that ‘sexism’ also carries with it connota- 
tions of disapproval, but we can divide the meaning of the term into 
an expressive or condemnatory component and a purely descriptive 
component. The most popular candidate for the descriptive compo- 
nent would be ‘differentiating between the sexes on the basis of 
their sex and nothing else’. 

This objection is mistaken, however. ‘Sexism’ refers not to just any 
sort of differentiation between the sexes, but to morally unjustifiable 
differentiation. (It is a failure to grasp this that leads some to say that 
any women-only event has to be sexist since it treats men and women 
differently simply on the basis of sex.) Thus, to know how to apply 
the term ‘sexism’ is to have some moral knowledge." 

The objection does point out an important feature of the case, 
however, and one that shows it to be the simplest, and most readily 
defended, sort of case. Peter already had a commitment to ending 
sexism. He was not borrowing from others the very principle “sexism 
is wrong.” That much, let us suppose, he had learned for himself 
(though probably not by himself). If asked, he would have been able 
to give reasons in support of it—he might, for example, have said 
that sexism shows a failure to accord all persons equal respect. Peter 
did not need to borrow his knowledge of the relevant principle. 
What he needed to accept on other people’s say-so was that this prin- 
ciple applied. We can thus distinguish between taking it on trust that 
a principle you independently acknowledge applies in a particular 


u! Sometimes there is a clear answer to the question of whether something is 
moral knowledge or the sort of nonmoral knowledge that feeds into moral knowl 
edge. (For example, average daily calonc intake of the poor may be the sort of fac- 
tor on which the justice of a society supervenes, but it is clearly nonmoral.) But I 
do not thmk this distinction is always easy to draw. For example, should we say that 
the fact that someone will be upeet by a course of action 1 a purely nonevaluative 
fact, though one which may contribute to determining whether a course of action 
is right or wrong? Or should we say it us itself an evaluative conmderation? There 1s 
reason for thinking it 1s evaluative. Being upset contrasts with what 1s colloquially 
called “having a hissy fit,” since the later implies that the distress is prima facie un- 
justified. ‘Upset’ may thus have connotations of prima facie justified (or at any rate 
not prima facie unjushfied) distress. It w worth notung that, ff you want to maintain 
a sharp separation between moral and nonmoral epistemology and allow tesumony 
to the latter but not the former, then you had better be able to draw clear hnes be- 
tween the moral and the nonmoral. A view like my own, that postulates no such 
sharp dichotomy at the level of epistemology, need not worry about being able to 
raw this distinction in every case. 
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case, and taking it on trust that a principle is correct. In between 
these two, we can also identify taking it on trust that the principles 
you recognize on independent grounds are to be ranked in a given 
way.'? (Such rankings are often at issue in setting political agendas.) 
We shall need to consider whether there is reason to be more or less 
suspicious of borrowing moral knowledge depending on what kind 
of knowledge it is that you are borrowing. 

While I think that Peter was mistaken to refuse to accept the wom- 
en’s testimony regarding whether someone was sexist, his refusal 
nonetheless displayed admirable traits of character and admirable 
strength of character. Peter’s action was not motivated by a desire to 
control the outcome of collective decisions. He made the decision to 
leave quietly and it was never intended as a threat to force the group 
to decide in the way he thought correct. The esteem in which Peter 
and the women held each other made it possible to sustain friend- 
ships despite the disagreement and despite Peter’s leaving. It was 
quite probable, too, that others shared Peter’s misgivings, but lacked 
his conscientiousness about moral matters. 

Peter's stance shows us the prima facie plausibility of the three 
considerations against accepting moral testimony: he shows a keen 
and admirable awareness of the importance of moral issues. More- 
over, his desire to have, and to understand, morally adequate reasons 
for acting makes us fairly sure, that, should Peter fall into morally 
bad circumstances, he will not be following the leader into error. 
This sort of independence—the independence of those who want to 
keep their moral beliefs in step with what they take to be cogent rea- 
sons—gives us a sense of the notion of autonomy which shows why 
autonomy is thought to be something valuable. Finally, had Peter ac- 
cepted the women’s testimony, he would still not have been any the 
wiser about how to go on to apply ‘sexism’ to new, nonegregious, 
cases. 

Overall, though, Peter’s case does more to undermine the argu- 
ments against testimony than to support them. In particular, it in- 


= This tripartite disunction does not map neatly onto Gibbard’s distinction be- 
tween fundamental and contextual authority (of. at, pp. 174-81). Contextual au- 
thority is authonty granted to another on the presupposition of shared norms 
Gibbard’s examples, however, suggest that he may have in mind cases in which the 
borrower is ignorant about some nonmoral facts of the situation and so lets some- 
one else's moral reasoning stand “proxy” (op. at, p. 174) for her own. A moral 
agent claims fundamental authority when she claims the superior trustworthiness 
of her own norms. But it seems that one can accord another some fundamental aw 
thority without accepting their norms on the basis of her say-s0, as when one takes 
the fact that someone believes a norm as a reason for accepting it, but as a reason 
that is, in the circumstances, outweighed (op. at, p. 180). N 
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vites us to reexamine the initial assumption that led us to believe 
that testimony must, for the morally mature, always be an inferior 
way of learning about morality relative to learning about it through 
argument. We are not supposed to need testimony, except when 
time is short, because someone who accepts a moral position can al 
ways make the grounds of her beliefs accessible to us, and we can 
come to share her reasons or not, depending on whether we find 
them persuasive. 

Reflection on Peter's case suggests two compatible responses to 
this argument. We might say that at least sometimes, the capacities 
involved in acquiring moral knowledge are indeed similar to percep- 
tual capacities. There is a moral analogue of blindness, which is not 
to be assimilated to being too foolish to follow a moral argument. 
Those who lack the relevant perceptual skills in a given moral do- 
main must rely on those who have sharper moral vision. 

Alternatively, we might point out that the contrast between testify- 
ing and arguing is not hard and fast. Expert witnesses both tesufy 
and argue. They offer arguments when presenting the considera- 
tions that lead them to adopt their views, but they testify, too, since 
the nonexpert to whom the testimony is directed is not in a position 
to assess fully the cogency of the reasons the experts offer in favor of 
their views. The nonexpert may not know exactly why a certain re- 
sponse on a given test indicates with such-and-such probability that 
the blood sample comes from the defendant, but will, having estab- 
lished the expert's credibility, take her word that it does. Offering ar- 
guments for a position thus need not entirely eliminate the reliance 
on the word of the testifier which is characteristic of offering testi- 
mony. Expert testimony shows the context sensitivity of the notion of 
testimony—directed at fellow experts, the arguments that for the lay 
person are also testimony would be just arguments. 

I think that both responses have a place and are not as problematic 
as they might first appear. In defense of the claim that moral knowl 
edge sometimes requires skills analogous to perception, we need only 
think of those occasions in which seeing that some moral value is at 
stake requires an emotional sensitivity to the particulars of a situation. 
For example, seeing that a situation calls for kindness may require an 
empathetic response involving imagination of what it must be like to 
be in someone else’s shoes. But we might not all be equally capable of 
such empathetic leaps—some of us may need to rely on those who are. 

Sometimes, those from whom we accept moral testimony stand to 
us, regarding certain issues, as moral experts. But the very idea of 
moral experts might seem deeply problematic. It seems profoundly 
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inegalitarian to claim that some are moral experts, since it suggests 
that others are not. While the higher branches of mathematics 
might be beyond some of us regardless of our best attempts, and 
available to others only if they make certain attempts, it is often 
thought that each of us is equally capable of being a good moral 
agent. And there can be no experts where capacity is equal.'* More- 
over, an induction on the past history of those who have claimed to 
be moral experts and with what results might make us skeptical of 
the very idea of moral experts. Many who have claimed that role 
have used it to advance self-serving and morally dubious views. 

These concerns about moral experts need not be taken as con- 
cerns about the very idea of moral expertise. They are for the most 
part generated by worries about establishing a socially sanctioned 
role of moral expert. Having a socially sanctioned role of moral ex- 
pert is going to raise issues of credentialing—Who are the moral ex- 
perts and who gets to decide?—and issues of what authority those in 
the socially sanctioned role of moral expert are to have over policy 
formation, and over the rest of us who may disagree with their judg- 
ments. These issues, however, do not arise when considering the 
question of what sorts of circumstances, if any, are ones in which an 
agent should decide to rely on someone else's moral judgment 

We can further reduce the sting of acknowledging that there can 
be expert testimony on moral matters by distinguishing between ex- 
peris and expertises, * and between global and local claims to expertise. 
While we think that there can be few experts on any subject—if your 
knowledge is not rare and hard to come by, you would not merit the 
title “expert"—expertise can be widely shared. Normal adults have 
considerable expertise at a wide variety of things. To borrow a list 


% See Arthur Caplan, “Moral Experts and Moral Expertise: Does Either Enst?” 
reprinted in his If] Were a Rich Man Could I Buy a Pancreas? (Bloomington: Indiana 
UP, 1992), pp. 18-39. 

4 This thought les behind the strong suspicion that if, for example, Lawrence 
Kohlberg’s scale of moral development consistently scores women, the working 
class, and members of other cultures at a lower level of moral development than 
educated middle-class white males, this must indicate some bias in the scale. The 
suspicion of bras is not removed by the fact that when education and type of em- 
ployment are controlled for, men and women score the same. In contrast, we 
would not infer bias ın a scale that aimed to measure mathemancal skill if it led to 
similar results. We would simply say that those with certain educational back- 

unds and occupations were, most likely, better mathemancians than those who 
these backgrounds. While it might be plausible that having a certain back- 
ground exposes you to certain types of moral problems, however, everyone, we 
think, has exposure to moral problems of some type or other, and thus everyone 
will have the sort of experience necessary to develop their potential as moral 
agents, though this development may take different forms. 
* For this disunction, see Caplan. 
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from Arthur Caplan, “how to drive a car, mail a letter, exhibit good 
manners, use a post office, play games” (op. cit, pp. 30-31), and, we 
might add, considerable expertise in ethical matters. Thus, in saying 
that someone can sometimes stand to another much as expert witness 
to lay person, we are not committed to saying that the person who 
currently does not have the expertise could not acquire it, though in 
some cases the kinds of character change and changes in experience 
necessary in order for them to acquire it may be sufficiently radical 
that, for all practical purposes, they are unable to do so. More impor- 
tant, we are not committed to saying that one person stands to an- 
other as expert to lay person in all areas of value. If that were indeed 
the case, it would be tempting to say that the one forced to rely on 
the other would lack moral maturity, so we would be talking not 
about a case in which the morally mature agent needed to borrow 
moral knowledge but about a case in which the agent was still a be- 
ginner. 

Although lowering the cost of the analogy between expert testi- 
mony and testimony regarding morality, none of what has been said 
so far gives us any reason to suppose that moral expertise might be 
unevenly distributed with respect to certain domains."* But evidence 
for this claim is not hard to find. Examples like that of Peter provide 
us with the sort of evidence we need. Nor does positing such differ- 
ences in likely familiarity and knowledge commit us to any epistemo- 
logically radical claims. Baier (op. cit, p. 247) and others argue that 
one’s moral experience shapes the kinds of questions one will find 
salient: had moral theorists begun from the experience of women, 
they would have paid more attention to trust, since trust is of vital 
importance in relations among unequals. 

Political engagement in liberation movements can also be expected 
to lead to greater insight and understanding about the values at stake 
than is likely to be available to someone who lacked that sort of experi- 
ence. One way to find out about the value of respect is to ask those who 
have always been respected; a better way is to ask those who have strug- 
gled to win respect and tried to live without it. Saying that such political 
experience can contribute to moral knowledge commits us to nothing 
stronger than the view that one’s knowledge about moral matters, like 
one’s knowledge about other matters, can deepen with experience. In 
particular, the suggestion that, for example, women actively engaged in 
feminist movement, might have acquired some expertise about the 


* In fact, this would have to be concerning certain areas only, or we would run 
afoul of the idea that, globally, there will not be much difference in moral skill be- 
tween people in distinct groups. 
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kind of disvalue sexism is, and the often subtle forms it can take, does 
not require us to defend a notion of epistemic privilege of the kind de- 
fended by standpoint theorists.” It is enough if, through social location 
or choice, individuals come to have richer experiences with certain 
types of moral problems, We need not follow standpoint theorists and 
suppose in addition that the marginalized are always, or even generally, 
in the best position to understand the truth about social relations.” 

If we allow that capacities akin to perceptual capacities are sometimes 
involved in generating moral knowledge—especially the knowledge of 
when values are at stake in particular situations—and tf we admit that 
there can be local differences in moral expertise, differences explained 
by different experiences, then we have come some way toward showing 
that there can be room for moral testimony. Sometimes, we do need to 
hear, and to believe, reports of explorers returning from realms of value 
that are less familiar to us than to them. Nonetheless, one might com- 
tinue to be uneasy about moral testimony: What of the objections that 
borrowing moral knowledge underestimates the importance of moral 
matters, overlooks their practicality, and conflicts with autonomy? The 
first of these objections is best dealt with now, while the second and 
third need to wait until we have a clearer understanding of when an 
agent is justified in relying on moral testimony. 

We need not suppose that we have to meet the first objection on 
exactly its own ground. There is a good case to be made for just 
denying that there is any difference between moral and other knowl 
edge with respect to the way it engages our responsibility: we are 
every bit as accountable for believing the racist science of The Bell 
Curve”? as we are for having false moral beliefs. But the strongest case 
against this position can be made by affirming its premise. If it is so 


t For this use of ‘feminist movement’ rather than ‘the feminist movement’, see 
bell hooks, Fem:nzst Theory from Margin to Center (Boston’ South End, 1984). 

$ For a discussion of standpoint theory, see Nancy Hartsock, “The Feminist 
Standpoint: Developing the Ground for a Specifically Feminist Historical Material 
ism,” in Sandra Harding and Merrill B. Hintikka, eds., Discovenng Reahty. Femrnist 
Perspectroes on Ejastemology, Methodology, and Philosophy of Saence (Dor- 
drecht: Reidel, 1983), pp. 283-310. It would be a mistake to think that polincal en- 
gagement must count against objectivity. Even if such a anos means that one 
will approach questions with presuppomtons, provided one’s presuppositions 
are true (or approximately true), one can count as being objective. Note that un 
leas this were the case, the theory dependence of scientific method would count 
against the objectivity of science. For this notion of objectivity, see Richard Boyd, 
“How to Be Moral Reahbst,” in G. Sayre-McCord, ed., Essays on Moral Realism (Ithaca: 
Cornell, 1988), pp. 181-228; and Louse Antony, “Quine as Feminist The Radical 
Import of Naturalized Epistemology,” in Antony and Charlotte Witt, eds., A Mind of 
One’s Own (Boulder: Westview, 1993), pp. 185-225. 

* Richard J. Herrnstein and Charles Murray, The Bell Curoe. Intelhgence and Class 
Structure m American Life (New York: Free Press, 1994). 
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very bad to make a moral mistake, then it would take astonishing ar- 
rogance to suppose that this supports a do-it-yourself approach, espe- 
cially once we think about how expertise can vary with experience. 
Of course, the worry behind this objection might be that people will 
too readily rest content with borrowing the moral beliefs of others 
without any grounds for supposing them likely to be correct. And 
that, the objection goes, would be a failure to take moral matters as 
seriously as they should be taken. This assumes, however, that the 
borrower of moral knowledge has few responsibilities to meet before 
she can count as borrowing wisely. In what follows, I shall argue that 
the standards for wise borrowing are stringent indeed and so offer 


no comfort to the morally indolent. 
MI WISE TRUSTING 


I shall argue that the problem of how to be a wise receiver of moral 
testimony is a special case of the problem of how to be a wise re- 
ceiver of testimony more generally, and this in turn is a special case 
of the problem of how to be wise in one’s trust. Because of this, theo- 
ries of trust, and in particular accounts of trust’s justification condi 
tions, can offer insight into some of the epistemological issues raised 
by testimony. But first it is necessary to show that relying on test+ 
mony does, in the typical case, require trust in the testifier. 

It might seem simply obvious that testimony involves trust: where 
there is testimony there is trust, or so says common sense. The lan- 
guage in which we describe cases of testimony supports this claim, 
for we speak of “taking it on trust,” and of “trusting testimony.” 
While I shall argue that common sense is basically right here, it does 
take some additional argument to vindicate it—unless, that is, we 
have a minimalist account of trust. 

So as not to mislead the reader into thinking the connection be- 
tween trust and testimony follows just as a matter of course, I shall 
temporarily abstain from using phrases like ‘trusting testimony’ or 
‘taking it on trust’, and use instead the phrase ‘accepting testimony’. 
Someone accepts the testimony of another if she forms the belief 
that p on the basis of the other’s saying that p is the case. We can use 
testimony that we do not accept: for example, the movie critic whose 
rave tells us that we would hate the movie, or the suspect whose an- 
swer signals that she has something to hide. The thesis that needs to 
be defended can be put like this: in the typical case, accepting testi- 
mony requires trust in the testifier and, as often as not, in others as 
well. 

How plausible one finds the claim that there is a connection be- 
tween trust and accepting testimony depends, not surprisingly, on 


68 THE JOURNAL OF PHILOSOPHY 


the account of trust one accepts. If you think that trust is just a mat- 
ter of relying on others, then it is obvious that accepting testimony 
has got something to do with trust. Call accounts of this kind 
risk-assessment accounis. Diego Gambetta™ offers us an example of a 
risk-assessment account: 


. trust (or, symmetrically, distrust) is a certain level of subjective proba 
bility with which an agent assesses that another agent or group of agents 
will perform a particular action, both before he can monitor such action 
(or mdependently of his capacity ever to be able to monitor it) andin a 
context in which it affects Ais own action (thd, p. 217). 


On a risk-assessment account, it follows straightforwardly that accept- 
ing testimony requires trust: I would not accept your testimony un- 
less I assigned a sufficiently high degree of probability to your telling 
me the truth in these circumstances. 

Risk-assessment accounts do not distinguish among the reasons 
one might assign a sufficiently high degree of probability to an agent’s 
performing an action. One might think an agent will perform that 
action out of fear or stupidity, because it coincides with her own self- 
interest, or because she has, and wishes to display, goodwill toward 
those who are counting on her. 

Willbased accounts, in contrast, find trust only where there is re- 
liance on the goodwill of another. We are not to take ‘goodwill’ as syn- 
onymous with ‘personal friendly feeling’, or willbased accounts would 
be obviously too narrow. Sometimes those we trust have the relevant 
goodwill just in virtue of being morally decent, or honest, or caring 
about fulfilling their duty. Since wilbbased accounts are more restrictive 
than risk-assessment accounts, if J can show that even on such accounts 
trust is needed, then I shall have shown that whatever account of trust is 
ultimately endorsed, accepting testimony requires trust. Consider the 
following willbased account of trust according to which trust is: 


(a) an attitude of optimism that the goodwill and competence of an- 
other will extend to cover the domain of your interaction, together 
with 

(b) the confident expectation that the one trusted will be directly 
and favorably moved by the thought that you are counting on her.® 


There are clearly some nonstandard cases in which I can accept tes- 
tiimony without having an attitude of optimism that the goodwill of 


» “Can We Trust Trust?” in Gambetta, ed., Trust: Making and Breaking Cooperative 
Relations (New York: Blackwell, 1988), pp. 213-38. 

" For a defense of this account, see my “Trust as an Affective Attitude,” Exhscs, 
cvi (1996): 4-25. Baier defends a will-based account (op. at). 
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another will extend to cover the domain of our interaction. I may, for 
example, have evidence that you would not dare lie because I have a 
threat advantage, and know that you are someone who is susceptible 
to threats. In a different scenario, I might know that it will somehow 
suit your devious purposes to tell me the truth, here and now 
(though in general you would not). Usually, however, we are in ner 
ther position: we have no threat advantage sufficient to guarantee 
performance without also relying on goodwill, and we do not have 
the kind of information that would let us know when those who are 
generally untrustworthy are in fact telling the truth. The false witness, 
like the lying promise giver, is threatened with the possibility of never 
again having her testimony believed. But this threat is not sufficient 
to guarantee performance in such a way as to eliminate the need for 
reliance on goodwill, and to be effective at all it requires both that we 
be able to tell when we have been deceived and that we will again en- 
counter the false witness.” These conditions are frequently not met. 

What about the need for optimism about the competence of an- 
other? In accepting testimony, we typically lack the kind of proof 
about the competence of another that would securely ground our 
optimism in her competence. Indeed, certain kinds of proof are 
ruled out. [f testimony is to be at all useful, I cannot myself go and 
check up on what you are saying. Sometimes, my not doing so is 
merely a matter of convenience: I accept testimony precisely because 
it would be too bothersome to learn for myself what I can learn from 
you. But where testimony has the potential to have rich epistemic 
yield, I am not in a position to do the kind of checking up on your 
judgment which would be required. I may be ignorant about, for ex- 
ample, some mathematical theorem, and lack the expertise to find 
out the truth of the matter on my own. Even if I attempted to check 
up on your word by trying to prove it on my own, I, novice that I am, 
shall often have fewer grounds for confidence in my own judgment 
than I have for confidence in yours.* 

At this point, it might be objected that I have failed to distinguish 
two kinds of checking up: I may not be able to check up on the con- 
tent of what you say, but I may be able to check up on you” We often 
do call on other people’s assessment of our informant’s character 


™ For this penalty in the case of the false promisor, see David Hume, A Treatse of 
Human Nature, P. H. Nidditch, ed. (New York: Oxford, 1978, 2d ed.), p. 522. 

* For this response to the objection that trust is not needed ın the scientific com 
text because game-theoretic considerations alone will be sufficient to guarantee 
honesty, see Hardwig, p. 704 

" Hardwig makes this point (of at, pp. 698-99). 

= These are distinguished by Dworkin (op. at, p 56). 
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and expertise before accepting their testimony. But this move does 
not eliminate trust; rather, it grounds my trust in you in my trust in 
others. It may disperse trust, but it does not make it redundant. 

To show that it is true on the account of trust in question that ac- 
cepting testimony typically requires trust in the testifier, we also need 
to show that such acceptance will require the confident expectation 
that the one trusted will be directly and favorably moved by the 
thought that one who trusts is counting on them. It is easy to show 
that this is also required: conversational exchanges do not come la- 
beled, conveniently announcing themselves as the kinds of ex- 
changes that they are. Before I can accept what you say in response 
to my request or need for information, I must first assume that what 
you are doing is mn fact responding to my request or need—that you 
are not, for example, teasing me by saying something silly since you 
think a silly question deserves a silly answer. 

The common-sense assumption that accepting testimony involves 
trust is borne out, if not for every possible case, then for the standard 
cases and these form the vast majority of cases of accepting testi- 
mony. 

If this is right, then the question of what it is to be a wise and re- 
sponsible recipient of moral testimony just will be the question of 
what it takes to trust wisely in these domains. And that is a nice re- 
sult, because we know something about the justification conditions 
of trust. 

In Testimony, C. A. J. Coady tries to argue that our default stance 
should be one of trust in testimony: we require some evidence to 
move away from this stance and none to adopt it. Viewed from the 
perspective of accounts of when trust is justified, this claim invites 
suspicion. It is simply impossible to say what our default stance with 
respect to trust in general should be. For this reason, we should be 
surprised indeed to find that one could defend a default stance of 
trust with respect to testimony, given that testimony can cover a wide 
range of areas, be given by a wide range of testifiers, and occur in 
different social climates and different institutional settings.” 


™ Fricker notes the context sensitivity of appropriate trust in testimony (op. at). 
She accuses Coady of an inconsistency in saying that our default stance should be 
one of trust and at the same tme allowing that we accept testimony only when 
ngns of untrustworthiness are absent (op. at, p. 405). But this is sull to have a de- 
fault stance of trust. So long as I do not need positrve evidence of trustworthiness 
before I trust, then my default stance is one of trust In what follows, I argue that 
sometimes our default stance can be distrust, thus my position may be more skept+ 
cal than Fricker’s. 
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The reason there are no useful generalizations to be made con- 
cerning whether trust, distrust, or neutrality is the appropriate de- 
fault stance is that the appropriate default stance is sensitive to 
climate, to domain, and, linked to domain, to consequences. Fur- 
ther, for the individual truster, the appropriate default stance is 
linked to the reasonable assessment of the tendencies of her own 
trusting and distrusting.?’ 

In climates in which the there is strong motive to be untrustwor- 
thy, it would require more evidence to believe that a person was 
trustworthy than it would take to believe that a person is trustworthy 
in climates where there is little incentive to untrustworthiness. Thus 
in unfavorable climates, our default stance should be distrust, and it 
should take considerable evidence to move from that position to 
trusting; in more favorable climates, however, a default stance of 
trust can sometimes be warranted. 

Domain also affects which default stance is appropriate: if I am to 
trust you with a secret of mine, then I shall want to have quite a bit of 
evidence about your character. If, though, I am to trust you not to at- 
tack me in the street, I may need no particular evidence about your 
character at all. This might seem to be a counterintuitive result, 
since surely it is worse to be attacked in the street than it is to be em 
barrassed by a confidence indiscreetly betrayed. Domain is generally 
more important than the consequences of misplaced trust, however, 
because domain signals likelihood of performance. We are all aware 
of the lively attractions of gossip and the strength of character re- 
quired to resist them. In contrast, it is not hard to refrain from injur- 
ing a stranger on the street; that just takes basic decency, a trait that 
we can assume is widely shared, unless the climate is sufficiently bad. 
Once we hold domain fixed, consequences become of the first im- 
portance: of course, | am going to need more evidence of your trust- 
worthiness before I tell you a secret that, if spread abroad, would be 
very damaging to me, than before I tell you a secret whose disclosure 
would be merely embarrassing.™ 


r These conditions for justified trusting are further elaborated and defended in 
my “Trust as an Affective Atntude,” especially at pp. 20-25. 

" Hardwig wonders about the extent to which the truster of scientific testimony 
needs to have good evidence for her belief that the inquirer ıs trustworthy (op. at, 
p. 699). It seems to me that his account would be more plaumble if he were to take 
differences in domain into account. Scientific inquiry has an historically poor track 
record concerning poliucally senmtive topics, such as race and gender. Whatever 
we are to say about the kinds of evidence needed before accepting the research of 
other scientists in general, ıt seems we need extra caution when it comes to re- 
search that can validate social relations of dominance and subordinanon 
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While climate, domain, and consequences are variables determin- 
ing which default stance is justified that extend across agents, the 
fourth variable determining the appropriate default stance is agent 
specific. Some agents have reason to be distrustful of their tenden- 
cies toward trust in certain domains. Others have reason to be dis- 
trustful of their tendencies toward distrust. Here we might think of, 
for example, racist tendencies to distrust young African-American 
men. Such agents should be suspicious of their own default stances, 
and that, in turn, should lead them to be more willing to abandon 
them than those who can trust their trust and distrust. 

If climate, domain, consequences, and metastances of trust or dis 
trust are the important variables determining how much evidence is 
warranted before trusting, we can see why there can be no answer as 
to the appropriate stance to take toward testimony in general. Most 
people can be trusted to give an honest and informed answer to cer- 
tain questions, including the mundane sorts of questions we often 
rely on strangers to answer for us, such as what time it is, how we get 
to the bus stop, and so on.” But when it comes to matters where 
there are reasons for not being straightforward—for example, social- 
science surveys of sexual activities—or to matters where competence 
cannot be assumed, we should not adopt a default stance of trust in 
testimony. 

This general discussion has implications for what we should say 
about how much and what sort of evidence we should demand be- 
fore trusting someone’s moral testimony. Moral knowledge regard- 
ing the sorts of matters for which we might need moral testimony 
cannot be easy to come by. If it were easy to come by, then we would 
expect it to be in the commonly shared stock of moral expertise. In 
addition, our moral thinking is vulnerable to distortion due to, for 
example, wish fulfillment and selfinterest. Having such knowledge 
requires a good character, as well as the right sorts of experiences. 
Since the best way to convince others to go along with your interests 
is to convince them that morality requires them to do so, we can ex- 
pect tendencies toward untrustworthiness—perhaps deliberate, per- 
haps as the result of self-deception. 

If this is right, then the appropriate default stance toward testifiers 
about morality is one of distrust. A default stance of distrust gives a 
presumption against accepting moral testimony, where this pre- 


* But even this list is climate sensitive. I am told that in Egypt it is thought digre- 
spectful not even to attempt to offer directions to those who ask for them. Rather 
than be disrespectful, an informant will send you in completely the wrong direc- 


tion. 
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sumption can take the form of either disbelieving such testimony, or 
withholding judgment. What distinguishes a default stance of dis- 
trust from a stance of neutrality is that a substantial weight of evi- 
dence is required to rebut the presumption; with neutrality, there is 
no presumption that stands in need of rebutting. 

In discussing Peter’s case, I conceded that it was the simplest sort 
of case, since it involved relying on someone else's judgment that a 
principle he already accepted applied. If your source is wrong about 
the relative importance of a good, or wrong about which principles 
or ends to adopt, end you take her word for it, your error is going to 
surface not just in one place but in many; if, however, you rely on 
someone’s judgment that a principle you accept on independent 
grounds is at stake in a given situation and she is wrong, at least the 
error you inherit is contained. The intuition that these kinds of cases 
are importantly different is shown to be just a special instance of the 
way in which consequences interact with domain to affect the justi- 
fiedness of trust. 

But acknowledging that the appropriate default stance toward tes- 
tifiers on moral matters is distrust does not mean that we can never 
be licensed in moving from distrust to a position of trust. Before we 
do so, we shall want to have good evidence about the person’s char- 
acter, about possible hidden agendas, and about whether she has the 
sort of experiences that contribute to the kind of competence we are 
counting on her to have. We would also want to know that our wit- 
nesses have appropriate epistemic self-assessment,™ and are not 
given to asserting with confidence claims that they are in a position 
to assert only tentatively if at all. Perhaps the most trustworthy testi 
fiers about moral matters are those who are least inclined to offer 
such testimony. 

Returning to Peter’s case, we can see, I think, that had Peter ac- 
cepted the women’s testimony, his trust would have been justified. 
Having lived in the same household, Peter had available enough of 
the right kind of information to justify moving from a default posi- 
tion of distrust to one of trust. Both sides thought well of each otb- 
er’s character. . 

In addition, Peter had reasons to distrust his own distrust. It might 
be objected that Peter could not know this in advance of knowing 
whether the women were in fact correct. But if Peter could have 
known whether the women were correct, then there would have 
been no need to trust them in the first place. This objection forces 


™ For the notion of epistemic self-assessment, see Hardwig, p. 700. 
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us to consider whether the notion of appropriate metastance can 
pull enough independent weight to enter into our assessment of 
when trust is justified, or whether our assessment of appropriate 
metastance must always be consequent on our assessment of appro- 
priate first-order trust. 

I think, though, that the example itself shows why this objection is 
mistaken. Peter had plenty of evidence to make him suspicious of his 
distrust. His past track record reveals a tendency to find fewer occa- 
sions calling for antiracist and antisexist response than others. More- 
over, given what Peter knew, or should have known, about how 
experience affects competence in these matters, he should have 
been aware that others might be better placed to judge than himself. 

In the example, I assumed the women were in broad agreement 
about whether there was evidence of attitudinal sexism. Had there 
not been such agreement, though, it is still possible that Peter would 
have been justified in trusting the women who found evidence of 
sexism. Agreement is by no means a sure indicator of truth. Some- 
times it counts against the trustworthiness of testifiers, signaling that 
they are in cahoots, rather than independently correct’! And some- 
times, especially in political contexts, it counts against the trustwor- 
thiness of testifiers by signaling that some voices have yet to be 
heard. In fact, given the complex interests that are so crudely cap- 
tured within identity politics, unanimity among the members of an 
oppressed group is generally counterevidence for trustworthiness. 
One of the chief responsibilities of wise trusters will thus be deter- 
mining whom to trust when there is disagreement. 

The evidential standards that must be met before trust in someone 
else’s moral testimony is justified are high. This means that the 
morally indolent can take no particular comfort from a defense of 
the possibility of borrowing other people’s moral knowledge. There 
is still work to be done and responsibilities to be met before we trust 
testifiers on moral matters wisely. But we can see now how to re- 
spond to one of our outstanding worries, namely, the concern that 
accepting moral testimony conflicts with autonomy. We must subject 
judgments about whom we trust on these matters to rational 
scrutiny, and we can be called on to revise our assessments of who is 
trustworthy. Accepting moral testimony is shown not to be in conflict 
with the value of autonomy, properly understood. 

We might still wonder, though, whether, had Peter relented and 
trusted the testimony of the women, there would have been some- 


n See Coady, p. 192. 
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thing inferior about his epistemic status. There are two questions 
here: Could Peter have actually come to know through trusting? Is 
his knowledge deficient as practical knowledge? 

While I think that wise trusters can have knowledge on the basis of 
testimony—provided, that is, they are lucky enough to trust trustwor- 
thy knowers—lI am going to leave that issue to one side so as to take 
up the issue of practicality. Even if Peter had been willing to bor- 
row the women’s moral knowledge, at the end of the day he would 
have been as much in the dark as to how to go about applying ‘sex- 
ism’ to new nonegregious cases as he ever was. Of course, we can 
hope that with discussion and practice he will increase his sensitivity. 
There is, though, no guarantee that this will happen. Borrowing may 
be the best that Peter can ever do. 

So does this not show that there is something regrettable about 
the need to borrow moral knowledge? At most, it shows there are 
some cases in which it is regrettable. Consider now a case of a differ- 
ent sort, concerning ranking principles or ends. The Chinese author 
Luxun once quipped that the Chinese do not understand freedom, 
they only understand cooking oil. Let us start from this quip and 
construct a case. It is China in the early twentieth century and two 
long-time friends are discussing whether to join a political move- 


™ Briefly, the argument for this claim is as follows: in accepting your testimony, I 
am attempting to borrow your knowledge, that is, to clam the knowledge you have 
as my own on the basis of your having shared it with me. But I cannot borrow from 
you what you do not have. Thus, before my accepting your testimony can be knowl 
edge conferring, you yourself must have knowledge regarding the queston at 1%- 
sue. If you are wrong, and I take your word for it, I obviously do not know it; 
likewise, if you are night, but only accidentally right, my being right is no more se- 
curely tethered to the truth of the matter than your being right is. 

But while your knowing that p is a necessary condition for my knowing that p on 
the basis of your say-so, it u not a suffiaent condition. Even if you know that p, you 
might not be trustworthy with respect to this exchange of information. For exam- 
ple, you might be tellimg me that p because you think that 1s a good way to have me 
come to believe that not-p, This would be true, if, for example, you falsely believed 
that I was acutely suspicious of you and so would take your word to be counterevr 
dence for the truth of p. To the extent that my acceptance of testimony rests on 
trust (and not, for example, on having access to the roundabout reason you hap- 
pened to tell me the truth), you must be trustworthy in the relevant domain before I 
can know that p on the basis of your telling me that p If you are not trustworthy, 
then you could just as well be me a falsehood, and I would be in no position 
to discern this; thus my connection to the truth of p, arrived at through your say-so, 
would be too tenuous to count as giving me knowledge that p, even if you yourself 
knew that p 

None of this shows that those who trust wisely, and are fortunate enough to trust 
trustworthy knowers, can thereby have knowledge But, as Coady has pe 
argued, our knowledge 1s so testimony saturated that, if the wise truster cannot 
know, then there is little indeed that anyone can know. 
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ment for democratic freedom. One has returned from study in Ev- 
rope and has himself joined and is trying to persuade the other to 
do so. His friend is reluctant: “It is dangerous,” he says. “Don’t we 
have responsibilities to maintain certain traditions? And isn’t the 
very idea of such change altogether too European?” The one who 
has returned from Europe tries to convince his friend that he does 
not know what freedom is really like and that, if he did, he, too, 
would consider the risk worth it and the tradition stifling. “You 
haven't,” he says, “tasted freedom, and if you had, you would choose 
as I do.” They talk, we might suppose, into the small hours of many 
nights. Finally, the other also decides to join the political movement, 
because of his trust in his friend. But that choice does not leave 
everything as it was before. Having made the choice, he is thrown 
into different circumstances and comes to see things differently. 
Through pursuing democratic freedom, he comes to understand 
more fully its value. He now knows first-hand what he initially knew 
only second-hand, and his belief, originally grounded in testimony, 
comes to have rich alternative sources of support. 

It might be objected that the newcomer to the political movement 
did not need to trust his friend at all and that he was not in fact de- 
ferring to his friend’s moral expertise—he was just trying it out and 
there is no deference involved in trying something out. But this re- 
construction of the example overlooks two important features of it. 
First, the newcomer initially had objections to joining the move- 
ment, objections that had to be overcome before he could see him- 
self as justified in even merely “trying” it out. Second, without at least 
some trust in the friend, it would not have been responsible for the 
newcomer to put himself in circumstances which would predictably 
lead to changes in his experiences and values, changes which from 
his current perspective look morally suspicious. It is no comfort to 
know that once we make a choice, we shall come to have experiences 
that confirm that choice as the right choice, unless we also believe 
that such confirmation will be truth conducive. Joining a cult might 
lead us to have experiences that will confirm our decision—but that 
we shall have such experiences is no reason to join; rather, it is rea- 
son to steer well clear, least our judgment come to be distorted by 
our membership. The newcomer thus had to trust the friend that 
membership in the political group would generate experiences that 
were truth conducive rather than error confirming. 

At this point, it might be thought that my defense of testimony re- 


mains grudging. It is better when knowledge initially gained through 
testimony is fully integrated into a new evaluative outlook and the 
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person comes to know first-hand what she originally knew only sec- 
ond-hand. Someone might accept this and yet still maintain that we 
have an obligation to cultivate all our moral capacities and so use tes- 
timony concerning moral matters only as a last resort when there is 
no other avenue open to us to secure the knowledge we need. Is it 
not part of living a morally serious life that we keep pushing at the 
boundaries of that which we must take on trust until the place for 
testimony shrinks to near vanishing? 

The suggestion that we have an obligation to keep pushing back 
the place for testimony rests on some empirical assumptions, how 
ever, that we should hold doubtful: in particular, it assumes that we 
can develop our moral capacities to an equal extent. Given the world 
of value is complex and the capacities needed to navigate in it vari- 
ous, there is reason to suppose that we cannot develop all morally 
relevant sensitivities to an equal extent. And we may do worse by try 
ing than by practicing wise trusting. 

These remarks suggest an interesting answer to the question of the 
unity of the virtues. I appear to be assuming that you can have one 
virtue without having another. Owen Flanagan™ argues that psycho- 
logical research shows the capacities that support the particular 
virtues do not come in a unified package; thus one can be coura- 
geous without being kind, and we should abandon the thesis of the 
unity of virtue. If one feels the pull of these empirical considere 
tions, yet also feels the pull of a Socratic a priori argument for the 
unity of the virtues, then hope for reconciliation lies in the notion of 
wise trusting. Recall the argument for the unity of the virtues goes as 
follows: virtues are states that issue in only right action. Justice and 
kindness can appear to conflict: if one’s kindness is not to issue in 
wrong action, one must also possess the virtue of justice (kindness 
that leads to injustice is no real kindness). If we think of a virtue as a 
sensitivity to considerations of a certain kind, then given that there 
are no limits on the combinations of such considerations that can be 
present in a choice situation, such apparent conflict between the 
virtues is always a possibility.” Thus, if the virtues are to result in only 
right actions, one cannot have one without having them all. 

The notion of wise trusting lets us accept the empirical claims typi- 
cally taken to support the disunity of the virtues, while retaining a 
kind of unity to the virtues. While the virtuous person might not her- 


” Varetes of Moral Personality’ Ethics and Psychological Reahsm (Cambndge: Har- 
vard, 1991). 

* For this formulation of the Socratic argument, see John McDowell, “Virtue 
and Reason,” TAs Moni, LXU (1979): 331-50, expecially pp. 331-83 
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self have all the capacities necessary for her virtue to lead to only 
right action, she will have appropriate self-assessment and appreciate 
the limits of her capacities. She will know when she needs the moral 
help of others. Moreover, she will be wise in whom to trust. Her ca 
pacities, supplemented by the capacity for wise trust, will issue in 
only right action. 

If these reflections are along the right lines, then we can see an- 
other sense in which the virtuous person’s virtue is essentially social: 
it is not just that the social sphere provides much of the domain for 
the exercise of virtue, nor just that it provides the conditions for $us- 
taining it; rather, the knowledge that the virtuous person has ‘is 
knowledge made available to her in part through epistemic depen- 
dence on others. 

KAREN JONES 
Cornell University 
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RESTITUTION AND REVENGE* 


hy punish wrongdoers? I have never found standard ret- 
W tes accounts of punishment to be able to provide a 

satisfactory answer. While the intuition that those who do 
evil deserve to be punished is not incoherent, it just seems to supply 
insufficient reason to harm wrongdoers.' Retribution does not seem 
to do much good, if any, for either the direct victim, society at large, 
or the criminal. I suggest that we instead try to answer the original 
question by understanding punishment as a debt the criminal pays 
to his,victim(s) as compensation. This can obviously explain what 
good punishment does the victim of the crime. The payment of a 
debt can also benefit the criminal insomuch as it is a form of atone- 
ment. It can be so considered, for the payment provides the criminal 
an opportunity to make amends for his misconduct and demonstrate 
his remorse. It can also reduce the pangs of his conscience. The ap- 
peal of the atonement/debt approach, however, is not only that it 
can remove the guilt of the convict in his own eyes, but that it can 
create in the mind of the public the belief that he deserves to be for- 
given or, at least, granted a fresh start with a clean slate since he has 
made good on his debt. Since this form of punishment can benefit 
the guilt-stricken wrongdoer, it becomes understandable why it 


* might be said that the criminal has a right to be punished and would 


even choose on some occasions to exercise it. 

An additional advantage of this atonement/debt theory is that it 
actually provides a place for vindictiveness in our judicial system. 
The debt the criminal owes his victim can be paid when the latter 


* I would like to thank Frances Myrna Kamm for a number of belpful conversations 
on the topic of punishment 

! Jam not the only one puzzled about how the retributyist idea by itself can pro- 
vide sufficient grounds for harming criminals. See Warren Quinn, “The Right to 
Threaten and the Right to Punish,” in A. John Summons, Marshall Cohen, Joshua 
Cohen, and Charles R. Beitz, eds., Punishment: A and Public Affatrs Reader 
(Princeton. Univernmty Press, 1995), pp. 47-98, here p. 54; David Ross, The 
and the Good (New York: Oxford, 1930), pp. 57-60; S.L Benn and R.S. Peters, e 
Utilitarian Case for Deterrence,” in Dand M. Adams, ed., Piulosophical Problems in 


` the Law (Belmont: Wadsworth, 1996), pp. 416-20, here pp. 416, 419-20; and Jean 


Hampton, “The Moral Education Theory of Punishment,” John Rawls “The Prac- 
tice of Punishment,” Margaret Holmgren, “Punishment as Resttution’ The Rights 
of the Community,” David Lyons, “Punishment as Retnbution"—all in Michael J. 
Gorr and Sterling Harwood, eds., Crime and Punishment. Philosophical Explorations 
(Boston: Jones and Bartlett, 1995), pp 356-73, here p 372; 336-42, here p. 338; 
402-17, here p. 402; 317-21, here p. $17-—— respectively. 
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takes his legal revenge upon the former. The vindictive pleasures ac- 
companying legal punishment can either increase the value of any 
other form of payment received, or, where the criminal is destitute, 
can actually take the place of rectiving financial compensation. 
When a debt has been fairly determined and properly paid, both the 
victim and the victimizer can be (somewhat) restored, though in dif- 
ferent ways, to their pre-crime status as equals. My hope is that by the 
end of this easay, the reader shall see, contrary to the assumptions of 
much of the philosophical tradition, that justice and revenge are not 
antithetical. 
L FAIRNESS, DEBTS, AND EQUILIBRIUM 

Claims about fairness, debt payments, and the restoration of a pre- 
crime equilibrium are not new to the discussion of punishment. Ret- 
ributivists often speak of the criminal “owing a debt” or “having to 
pay for his crime.” But these metaphors belong to the vocabulary of 
restitution and do not cohere very well with the language of retribu- 
tion. Inevitably, a tension will result because retributivism is con- 
cerned with primarily giving the wicked what they deserve, 
punishing them just because they intentionally did something 
wrong. Given this emphasis, it is no surprise that retributivists who 
speak of “debts to be paid” will not do justice to such restitutionist 
concepts. When they mention debts in their writings, they often 
leave it unclear who the creditors are or the nature of the currency 
in which they are to be paid. 

We shall see shortly that Herbert Morris? is one well-known ret- 
ributivist who uses the language of debt but fails to locate the main 
creditors or the proper currency for the payment. F. H. Bradley,* on 
the other hand, gives lip service to the language of restitution, but 
then completely ignores it. Despite his carelessness, since he is the 
“purer” retributivist, I shall turn first to his ideas in order to high- 
light a major oversight and shortcoming of retribution. Then I shall 
explore Morris's work and point out some holes in his quasi-restitu- 
tionist account. In filling these gaps, I shall present an alternative, 
more robust account of how punishment can be considered a debt. 

Like most pure retributivists, Bradley pays too much attention to 
the evil of the criminal’s thoughts, intentions, and will, and ignores 
the evil that is the harmed condition of the criminal’s victim. For 
Bradley, it is not the evil consequences of the criminal act that must 
be offset by punishment but the evil embodied in the criminal him- 


* “Persons and Punishment,” The Monast, v, 24 (1968): 477-5601. 
* Ethical Studies (New York: Oxford, 1935, 2d ed.), p. 26: “We pay the penalty be- 
cause we ows it, and for no other reason” (italics added). 
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self. The emphasis just on the evil residing in the criminal, which is 
somehow “annulled” by the punishment, makes it seem as if Bradley 
does not believe, or rather overlooks, the real reason that the crimi- 
nal’s evil thoughts, intention, and will are so abhorrent. the actions 
resulting from these mental events bring considerable harm. Punish- 
ment should not lose sight of this harm. That is why it should include 
the paying of a debt that offsets the crime's evil consequences. If the 
harm wrought by the crime is kept in mind, the insistence on just re- 
moving the embodiment of evil within the criminal will appear quite 
unsatisfactory. That Bradley is guilty of such a narrow obsession 18 evi 
dent in his frequently quoted panegyric to retribution reprinted be- 
low. There he offers a truncated rationale for punishment in which 
the victim of the crime goes completely unmentioned. 


The wrong cries out for obliteration...if he can remove it it is his busi 
ness to do so.... The destruction of the wrong, whatever be the conse- 
quences, and even if there are no consequences at all, is still a good in 
itself...punishment is the denial of wrong by the assertion of nght, and 
the wrong exists in the self, or will, of the criminal, his self is a wrongful 
self, and is realized in his person and possessions; he has asserted in 
them his wrongful will, the incarnate denial of mght,; and in denying 
that assertion, and annihilating, whether wholly or partially, that incar- 
nation by fine, or imprisonment, or even by death, we annihilate the 
wrong and manifest the right, and this as we saw, was an end in itself, so 
punishment is also an end in itself (id, pp. 27-29). 


While it is not obvious exactly how punishment “annihilates the 
wrong and manifests the right,” it is rather clear that Bradley's ap- 
proach offers little benefit to the victim of the crime. Morris, on the 
other hand, does not focus only on the criminal’s wickedness. He 
discusses the losses a crime imposes on others and what sort of debts 
must be paid for the restoration of equality. But, as we shall see, he 
does not do justice to our intuitions about victims, fairness, debts, 
and equilibrium. 

Morris believes that members of society accept the restraints of the 
criminal Jaws so they can obtain the benefits of others’ restraint. In- 
dividuals do not violate the property and person of others in order 
to have the same protection reciprocated. Morris insists that, if a 
man does not restrain himself from wrongdoing while others do, he 
obtains an unfair advantage. His illicit gains must be offset by pun- 
ishment. The burden imposed by punishment replaces the burden 
he wrongfully avoided when he disregarded the law. Morris says the 
criminal owes a debt to those who did hold themselves in check (op. 
cit., p. 478). 
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The reader might wonder why law-abiding citizens should be com- 
pensated for restraining themselves? They each have a duty not to 
harm innocents. If someone else does harm an innocent, why are 
they (those who restrained themselves) owed anything? Further- 
more, even if we assume that it is unfair to them that the criminal 
did not restrain himself, how does his suffering compensate them 
for what they have lost out on? They are not receiving any tangible 
benefits to offset their extra burdens. The answer must be that Mor- 
ris just believes that since the law abiding have such a strong interest 
in the fair distribution of burdens, they will just fee] better, and thus 
it could be said that they have been compensated, when someone 
who dodged his fair share of burdens eventually receives it. 

There are a number of other problems with Morris’s account. First 
of all, he seems to overlook the real victim of unfairness—-and that is 
the person directly harmed by the criminal—the person robbed or 
raped or the like. It is she who has lost the most due to the criminal 
transgression. Her right to control her own person or property has 
been violated. Furthermore, the punishment of her victimizer does 
not restore an equilibrium of benefits and burdens between herself 
and him. While the criminal receives through his punishment a bur- 
den that he unfairly avoided by not restraining himself, and thus 
comes to have nominally the same burdens as nearly all of the law- 
abiding citizens, the actual victim of his assault or deceit is still left 
far worse off than he. She has not received any tangible compen- 
satory payment for the burdens his wrongdoing unfairly placed on 
her. 

Furthermore, Morris seems actually to overlook the real harms 
criminals bring to the rest of the law-abiding society while wrongly 
describing as harms certain features of a number of their actions. 
The latter mistake is due to the fact that there are countless crimes 
that most people do not want to commit. Few people have a desire to 
commit treason, beat old ladies senseless, or molest infants. It is not 
difficult for the majority of people to restrain themselves, for they 
are not at all tempted by the above and thus they do not suffer an 
unfair burden when the rare individual, that is, the criminal, ex- 
empts himself from the restraints that Morris wrongly seems to be- 
lieve weighs so heavily on law-abiding citizens. 

The real unfairness to the law-abiding citizens is not that they re- 
strained themselves and the criminal did not, but that they must pay 
a cost for their safety. They pay high taxes so the police can capture 
wrongdoers. Knowledge of crimes done to others even leads people 
to buy security systems and insurance for their own homes and prop- 
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erty and to restrict the time and manner that they can go certain 
places. This same knowledge leads them to live with increased anxi 
ety for the safety of themselves and loved ones. And if the law abid- 
ing are especially sensitive, it can also be claimed that another harm 
is done to them when they suffer vicariously with strangers who are 
crime victims. Given the above-mentioned costs that the law abiding 
must bear, it would seem that merely bestowing upon any criminal 


the burdens he avoided provides inadequate compensation. 
Il DEBTS FROM THE DESTITUTE 


So a debt theory requires not only a greater but perhaps a more tan- 
gible payment than that provided for by Morris’s account, and, most 
importantly, one that goes from the criminal to his direct victim, that 
is, the person he defrauded, attacked, and so on. Given this require- 
ment, the problem for many restitudon-minded retributivists, or 
their “pure” restitutionist cousins, is to explain how punishment can 
be called a debt payment in those cases where the individual victim 
or injured public is not receiving any money, goods, or services from 
the criminal. Many crimes are neither thefts of wealth nor involve a 
guilty party that is able to compensate his victims economically for 
the nonfinancial pain he caused them. 

It may be argued that the impoverished victim should still have his 
illicit gains offset by a jail term. Such punishment is often explained 
not by the standard retributivist maxim ‘that the wicked deserve pun- 
ishment in proportion to their evil’, but that ‘no one should benefit 
from committing a crime’. Although this cannot be easily thought of 
as a debt payment, perhaps one could understand the punishment 
as a payment of sorts to others—most notably, the direct victim (s)—~ 
who have an interest in the reestablishment of fairness. What they 
are bothered by is not the Morms-like unfairness in which the crimi- 
nal avoided burdens that they bore; rather, they just find it unfair 
that a criminal should keep the rewards of his misdeeds, for he does 
not deserve such pleasures. The direct victim is especially disturbed 
by someone’s illicit gains that came at his expense. In the present 
discussion of the criminal destitute at the time of his capture, the 
felon’s illicit benefits are only psychological. Any sadistic or other- 
wise ignoble satisfactions experienced during a crime or later relived 
in memory, as well as subsequent pleasures that are made possible by 
the transgression (for example, the joys of drugs, parties, vacations, 
and so on brought with the now squandered stolen money), could 
be offset by the psychological discomforts of a harsh confinement. 
When the pleasures in which the criminal had no right to partake 
are “subtracted” by the disutility of the punishment, he will be re- 


84 THE JOURNAL OF PHILOSOPHY 


turned to, approximately, the level of utility he was at prior to his 
crime. The obvious result of this is that the criminal does not benefit 
from his crime. 

But the principle that a lawbreaker be denied any advantages from 
a crime would not, by itself, always guarantee a prison term. Where 
the criminal did something that the law abiding had no desire to do, 
and if he were apprehended in the act or if the rewards of the crime 
were taken away before he could enjoy them, then since he did not 
benefit from his crime, materially or psychologically, a jail sentence 
will not serve to offset any gains. Any principle that leaves us without 
grounds for incarceration is obviously unsatisfactory. 

Even if the criminal did not receive any unfair advantages from his 
crime, there is no denying, despite the absence of any tangible restitu- 
tion, that most directly victimized people, and the public at large, care 
about a criminal’s comfort level. Their concern is in seeing it reduced 
by imprisonment. Moreover, it appears safe to assume that this is not 
merely because of either the superior deterrent effects that punishment 
brings or that incarceration is Instrumental to the reform of the pris- 
oner. If the motivation were just deterrence and/or reform, then, if for 
some reason this could be achieved without offsetting the psychological 
benefits the criminal received from his conduct, the interest in erasing 
such wrongful gains, or in their absence, just imposing burdens on the 
criminal, should vanish. But such interests are unlikely to disappear. 

So an answer still has to be given regarding why it is that most of 
us want to do something about the indigent’s criminal’s level of psy- 
chological well-being when he did not benefit materially nor psycho- 
logically from or during his crime, and only avoided lawful 
restrictions that we, the law abiding, did not feel to be at all burden- 
some. Because of such cases, it so far looks as if a debt theorist does 
not have any grounds for demanding imprisonment of the destitute. 
But a debt theorist might protest that the indigent convict’s punish- 
ment can consist of his being compelled to take part in some kind of 
public works that benefit all of society, including the direct victim. 
This will not do any good, however, if we assume the inmate is too 
frail to labor. And even if the convict could work, in most cases, his 
lack of skills would not offset the costs of his confinement. 4 More im- 
portantly, even if the criminal’s labor could pay for his room and 
board, the victimized are not likely, in most cases, to consider the 


* The problem of the inability of the unskilled to provide financial com 
tion plagues Randy Barnett’s economic libertarian approach to restitudon— 
tution: A New Paradigm of Criminal Justice,” reprinted in Gorr and Harwood, pp. 
392-401. 
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standard jobs done by prison work crews to be sufficient compensa- 
tion for their suffering. 

Some readers might think that a nondebt, nonreform, nondeter- 
rence-based justification for punishing those who are indigent can 
be found in Joel Feinberg’s® account of expressivism. My contention, 
however, is that the expression of condemnation can accompany pun- 
ishment that is primarily the paying of a debt, but it is a mistake to see 
the hardship of punishment serving primarily as a “conventional de- 
vice for the expression of attitudes of resentment and indignation, 
and of judgements of disapproval and reprobation” (iid, p. 98). I 
find what Feinberg calls the “symbolic significance” of punishment 
to be insufficient, for it is unable by itself to justify the length and 
severity of sentences that we believe proper. Feinberg claims, how- 
ever, that: 


given our conventions, of course, condemnation is expressed by hard 
treatment, and the degree of harshness of the latter expressed the de- 
gree of reprobation of the former; still this should not blind us to the 
fact that it is social disapproval and its appropriate expression that 
should fit the crime and not hard treatment (pain) as such. Pain should 
match guilt only as its infliction is the symbolic vehicle of public con- 
demnation (id, p. 118). 


Despite what Feinberg says, the guiding principle of his theory 
seems to render the hardships of punishment gratuitous. He himself 
admits the hypothetical possibility of a society with an “elaborate 
public ritual, exploiting the trustiest devices of religion and mystery, 
music and drama, to express in the most solemn way the communi- 
ty’s condemnation...rendering superfluous any further bard physical 
treatment” (tid, p. 116). Even though our society lacks such a pub- 
lic ritual, most criminals are quite aware that their fellow citizens re- 
sent them and disapprove of their conduct. Even those rare 
individuals who are unaware of this are unlikely to need to serve 
twenty years behind bars with the four walls of their prison cells serv- 
ing as blackboards on which society writes its lessons on condemna- 
tion. 

Instead of emphasizing the severity of the punishment as an ex- 
pression of society’s condemnation of the criminal, Feinberg can still 
insist that, if the criminal were not punished, it would express a mes- 
sage that the victim and his rights are not important Feinberg in- 
deed endorses something like this in his discussions under the 


* “The Expressive Function of Punishment,” m his Dong and Deseroing (Prince- 
ton: University Press, 1970), pp. 95-118. 
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headings of “authoritative disavowal” and “the vindication of the 
law." He writes: “What more dramatic way of vindicating (a man’s) 
violated right can be imagined than to have a court forcibly con- 
demn its violation through the symbolic machinery of punishment” 
(ibid., p. 104)? But this sll will not justify maintaining sentences that 
are extremely harsh. Society can just as well express its concern for 
the victim and his value through an outpouring of sympathy, regret, 
and various types of compensation and assistance. And it should not 
be thought that the reason we do not pursue this expressivist path is 
just that it would bankrupt the state. Even if the funds were available, 
there would still exist the need to punish those who victimized us or 
others. 

So it does not appear that expressivism can justify our desire to 
impose severe sentences upon the worse criminals. We saw that 
the retributivist defends harsh sentences by insisting that it is just 
good that the evil are punished even if the victims do not benefit 
psychologically or materially from the punishment of the crimi- 
nals who victimized them. Others, most notably J. F. Stephens,’ 
doubt that the interest in retribution can be separated from the 
desire for revenge. He famously wrote: “The criminal law stands to 
the passion of revenge in much the same relation as marriage to 
the sexual appetite” (ibid, p. 99). My own view is not that retribu- 
tion is really revenge, though often disguised.’ Rather, I object to 
retribution because it provides an insufficient rationale for pun- 


ishment. 
I RESTITUTION AND REVENGE 


My hope is that we can come to see the punishment of both those 
who gained from their crimes and those who did not as in some way 
equivalent to the repaying of a debt that makes the injured members 
of the public wealthier in some sense.® But again, what benefit could 
the harmed party receive when those to be punished are already im- 
poverished? It will sound very strange, even outrageous, but I want 
now to develop the idea that there is a place for revenge in a restitu- 
tionist framework for punishment. This is at odds with a long tradi 
tion of moral and legal philosophy which has always started with the 


* The General View of the Crominal Law of England (London: Macmillan, 1868). 

7 A plaumble example of nonvindictive retribution is when an adult reluctantly 
informs the police of the crimes and whereabouts of a beloved family member. He 
is clearly not getting any pleasure bringing his dear relative to fustice. 

* It should not be thought that a wrongdoer cannot be in debt to a person umr 
leas he received some sort of benefit from harming him. The model of a loan and 
its repayment is here misleading; for a person can destroy the property or peace of 
mind of another and thus have to make restitution, even though he never ob- 
tained any pleasure through the illicit action. 
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virtually axiomatic assumption that justice and revenge are diametri 
cally opposed. Nevertheless, I find it reasonable to raise the question 
whether a punishment that provides little or no payment of any kind 
to the victim will be sufficient for most people.” There was the nor- 
mal state of affairs, it was disturbed by a crime, then the criminal’s il- 
licit gains, if any, were withdrawn or offset. But this does not restore 
the status quo, for the deep wound and surrounding scar tissue, 
along with the pique, petulance, and resentment felt by the victim- 
ized, remain untreated. 

Even if the crime were unsuccessful and the victim experienced 
little loss other than anxiety (and perhaps the money he would then 
spend on security measures), the victim would still be left greatly 
bothered by the arrogance and contempt the criminal showed to- 
ward him, for it was these very states of mind that initially cleared 
the way for the villain’s transgression. If the criminal had more re- 
spect for his target, he would not have been able to violate the lat- 
ter’s wishes through his attempt. So, even if the attempt failed, it 
still implied the existence of certain attitudes that would insult and 
enrage the intended victim. 

Of course, it is not just the disrespect the criminal feels for the vic- 
tim that motivates the latter to retaliate. There is in many cases the 
actual physical, psychological, or financial suffering that the law- 
breaker caused. There exists a primitive need to get back at those who 
hurt us in these ways. Acting on this urge is often followed by a cry of 
“Now we are even!” My thesis is that the vindictive satisfaction that 
comes from “getting even” should be considered the debt payment 
that the victim receives from the criminal. 

But this equality does not come about merely by removing the 
gains of the villain as was the case in the earlier discussion of the 
‘no one should benefit from their crime’ maxim or the unfair ad- 
vantages principle that Morris defends. The vindictive person ac- 
tually feels much better when he exacts his revenge. Revenge 
(re)creates an equality between the original victim and original 
victimizer not just by lowering the well-being of the latter but by 
raising that of the former back to or near the status he enjoyed 
prior to being victimized. This is the most important equilibrium 
we seck, even if the satisfaction obtained through the harshest re- 
venge still leaves the original victim somewhat beneath his pre- 


* The victim of even a very wealthy criminal is likely to desire not only a great 
deal of financial compensation but also that his tormentor experience the discom- 
forts of prison life. The existence of such a desire, assuming it also is defensible, is 
an additional difficulty for Barnett’s scheme. 
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crime status.!° When the criminal is lowered to the state the victim 
was at before the latter’s compensation, an equality is restored be- 
tween the criminal and the victim. Both have suffered equivalent 
harms. This equilibrium is very important in their relationship, 
more so than offsetting the illicit gains that merely restore the 
criminal to his pre-crime status. This equality (or near equality) of 
harms is necessary for the two parties ever to resume a relation- 
ship as fellow citizens, each having a right to a certain minimum 
of formal] respect, privileges as well as duties. If the criminal has 
not been sufficiently harmed, those he directly (and indirectly) 
victimized, will not ever be able to deal with his return to civil soci} 
ety. When a debt has been fairly determined and properly paid, 
that is, the vengeance has been exacted but not in a way that is ex- 
cessive, both the victim and the victimizer can be (somewhat) re- 
stored, though in different ways, to their pre-crime status as 
equals. And it is this restoration of equality, with all its psycholog+ 
cal benefits, that justifies the debt theory. Thus it would be a mis- 
take to understand either the criminal’s suffering, the vindictive 
pleasure the victim obtains from it, or any other debt payment in 
and of itself as a justificatory primitive, that is, the ultimate princi- 
ple at the end of a chain of justifications. 

So the victimized individual and/or society is unlikely to forgive. 
and then accept the return of the criminal as a fellow citizen if it is 
not psychologically compensated to some extent, emotionally re- 
plenished after what it has lost in pleasure, security, wealth, liberty, 
and respect. Even if the victim is exceptionally forgiving and seeks to 
drop the charge without a debt payment, the criminal cannot go 
free. First, this much too quick offering of forgiveness is really such a 
corrupt form of forgiveness that the debt theory offered here does 


* But what if the victim m murdered? How can punishment benefit the dead 
when they obviously cannot experience anything? Will not the only vindictive ben- 
efit be that of his surviving family? Perhaps not. Consider the possibility that a per- 
son could have his interests frustrated without consciously experiencing their 
frustration and thus his level of well-being lowered without his knowing it Exam- 
plea of such non ential harms are when one never discovers infidelties of a 
spouse or that physically distant family members one believed to be thriving really 
were not. If the reader acknowledges that, if he were in these situations, his life 
would not actually be going as well as he thought it was, then he believes in nonex- 
periential harms. If there are nonexperiental harms, then there can be non 
ential benefits. And if a living person can be harmed or benefitted without 
knowing it, then there is not any reason why a person cannot be posthumously 
harmed or benefitted. Thus the murder victim can have his interest in revenge and 
justuce satisfied or denied by what happens after his death. For a more detailed ac- 
count of nonexperiential and posthumous harm, see Feinberg’s Harm to Others 
(New York: Oxford, 1984), pp. 79-89. 
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not need to be modified to account for it. Perhaps a person who for- 
gives too quickly has such low self-esteem he does not believe it was 
that bad for him to be treated as he was. Or the corrupt motivation 
for prematurely granting forgiveness is the victim really wants to for 
get rather than forgive. The mental discomfort that the crime 
brought—and perhaps the continued state of anxiety resulting from 
an ongoing conflict—is too hard to bear, so the victim “forgives” his 
foe in order to have peace of mind. Second, the criminal cannot f- 
nancially compensate all of society who must pay (in some sense) for 
the loss of their “peace of mind” as well as in the conventional sense 
for their own personal security measures and the criminal and legal 
systems that pursued the wrongdoer. They thus have the right to rel- 
ish his justified suffering even if his direct victim immediately “for- 
gives” him. 

Therefore, unlike a debt theory that only emphasizes the direct 
victim, my theory does not need to bring in the extraneous principle 
of deterrence to justify punishment where the immediate victim has 
prematurely “forgiven” the criminal. And since the debt theory does 
not rely upon deterrence as a justification, people are not used 
merely as a means for society's benefit. Moreover, this instrumental 
ist justification for punishment, which many find distasteful, can be 
avoided without a loss in deterrence. The debt theory can indirectly 
deliver about the same amount of crime prevention as we get if we 
understand the present system to be a prevention-based one of using 
men as examples to deter others. 

So allowing the satisfaction of vindictive feelings serves as a deter- 
rent and enables the victim and society to accept the criminal’s re- 
turn once they have received their “just fill.” Now this “just fill” 
should not be thought of something separate from punishment, 
which is how we currently understand compensatory damages 
through which society benefits the victim. Unlike most restitution- 
ists, I do not distinguish punishment from restitution. I do not be- 
lieve, as Randy Barnett’! advocates, that restitution should replace 
the system of punishment nor that restitution should be added as 
something over and above punishment. In my theory, punishment 
and restitution are not separable like a jail sentence and a fine but 
are one and the same. The satisfaction that the victim derives from 
the incarceration of his victimizer can be the full payment of the 
debt—or if accompanied by financial damages, it can still be consid- 
ered a form of compensation in and of itself. 


" In Gorr and Harwood, p. 398. 
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IV. THE PSYCHODYNAMICS OF REVENGE AND RESTITUTION 

Perhaps in the best of all worlds, people would not have vindictive 
feelings. If our natures were different, so would be our system of 
punishment. But since people do have vindictive feelings, it is pru- 
dent to channel them in a productive, civilized way. There already 
seems to be within men and women a desire for such judicial vindic- 
tiveness to restore the moral symmetry that existed between individu- 
als prior to a divisive injury. The egalitarianism of this passion is 
evident in the claim often accompanying revenge: “Now we are 
even!” This judicial vindictiveness can be distinguished from a type 
of unjust vindictiveness in which a sadistic delight is experienced 
when a man is punished more severely than he should be and/or his 
deprivation is not accompanied by a recognition of his right to some- 
day be restored to civil society as an equal. 

If judicial vindictiveness did not exist, a public infected by the 
other type of vindictiveness would prefer the criminal to find his 
every day in jail as unbearable as possible. But if the wrongdoer ac- 
knowledges his behavior to indeed be misconduct, sees his punish- 
ment as a necessary debt, a token offering to his victims that is also 
essential to his own expiation, then his stay behind bars becomes 
more tolerable than it would be otherwise. In my estimation, most 
aggrieved parties would prefer to see such remorse in the felon. 
They wish him severely disadvantaged but, nonetheless, desire him 
to agree to his confinement. Were it only pain that they sought him 
to undergo, there would be no interest in his serving an appropri- 
ate sentence or recognition of his right to be eventually restored to 7 
society. g 

Some readers might have experienced a desire parallel to what I 
am calling judicial vindictiveness in their interpersonal exchanges 
outside of the legal system. If injured in some way, they did not per- 
mit the restoration of relations until their victimizers had “paid” 
some type of nonstandard debt in the form of misery, guilt pangs, or 
even by being the recipient of a melodramatic slap to the face. Only 
then arose the feeling that the wrongdoer had suffered enough and 
could be forgiven and restored, or at least brought closer to his pre- 
vious status, 

On the other side of the equation, the psychodynamics of contri 
tion may be such that the wrongdoer will feel relieved when his vic- 
tims or their grieving families are satisfied by harsh justice or even 
revenge being served. Many readers have probably felt at some time 
much better when after a transgression they suffered some form of 
sanction, perhaps even bodily pain when they were youngsters, 
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which enabled their spirits to improve as a parallel improvement oc- 
curred in the mental states of those who demanded the punishment. 
In old movies, this often takes the form of one macho but contrite 
fellow allowing the party he earlier wronged to punch him squarely 
in the nose or jaw to even the score. The guilty often feel less so 
when their own suffering satisfies those whom they had earlier vic- 
timized. They atone for the crime by accepting such hardships.'’* The 
debt theory can be considered a form of atonement because (1) the 
payment provides the criminal an opportunity to make amends for 
his misconduct; (2) his willingness to make good on his debt sup- 
plies evidence that he feels remorse; and (3) the reduction of what 
he owes his victim can also reduce the pangs of his conscience. 

We can even speak sensibly of the criminal having a right to be 
punished in a culture where punishment is seen as a guiltremoving 
debt—though in most cases, of course, the criminal would waive this 
right. The right to be punished is a rather paradoxical way of talking 
initiated by G. W. F. Hegel though hinted at by Immanuel Kant. 
They both felt that not punishing the criminal would be to overlook 
his status as a free and rational citizen whose will is embodied in the 
law. He was at liberty not to commit the crime, so to withhold pun- 
ishment is to ignore his free agency and to show him disrespect.” 

This still is not the full story, however, behind why a man might want 
to be punished. In the atonement/debt version, what is stressed is the 


* This acceptance of punishment reveals an advantage the atonement/debt the- 
ory has over approaches that stress the reform of the criminal. If reform were the 
raison d’étre of punishment, then, at least in pnnaple, some men who commit 
heinous crimes could avoid fail or be released after only a brief imprisonment, if 
shortly after their crime they undergo a dramatic change in moral character This 
problem of early release plagues Jean Hampton’s “The Moral Education Theory of 
Punishment,” in Gorr and Harwood, pp. 356-75, But in the atonement/debt theory, 
the remorseful prisoner’s acceptance of the length of the standard sentence as a 
debt repayment u proof that he z reformed He atones by willingly serving his full 
sentence, If he seeks to serve less than the appropriate tme, he 1s attempting to 
avoid paying his debt in its entirety, which 18 something a genuinely reformed and 
remorseful person would not do. So there is neither an epistemological problem of 
deading who has been morally transformed and thus should be promptly freed, 
nor any quesnon of unfairness duc to different sentences for the same crime. 

> Morris argues that we can see that the cumumal has a night to be punished on ret 


cents. Morris assumes retribunon gives a greater control over his life since he is 
not harmed except in response to hus freely made decisions. But this i questionable. 
Conader the possilihty that framing or treating people lowers the crime rate so much 
that the probability that any type of harm will befall a man m this hypothetical nonret 
ributivist system is less than it would be in a retributrvist society. Thus a man would have 
more control over his life in a society with such institutions So Morris’s attempt to base 


the nght to punzshment on the greater autonomy 1s potentially self-defeating. 
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criminal’s right to pay a debt so he can be forgiven and restored to the 
community as an equal. His willingness to accept the punishment re- 
veals his genuine regret, and this makes him more likely to be accepted 
when he is eventually released. If a guilt-stricken criminal turns himself 
in to the authorities and demands punishment, it is more likely that he 
has in mind the atonement form of a right to punishment than the de- 
mand that his freedom and rationality be respected, as advocated by 
Kant and Hegel.’ He intends through the paying of a debt to lessen his 
own guilt and the hostility that the society at large feels toward him. His 
wish is to belong once again to a community of equals, which he real- 
izes cannot be fulfilled unless he is punished. 

A debt theory that allows the satisfaction of vindictive feelings to as- 
suage the victim’s sense of having been wronged even sheds some light 
on the much discussed problem of moral luck.” I have in mind here 
not the bad moral luck of having a certain disposition or being in cer- 
tain circumstances others avoided.'® Rather, I am concerned with the 
luck that determines whether one is punished harshly for a successful 
crime or more leniently for a merely attempted but unsuccessful crime. 

It strikes many philosophical observers as odd, for example, that 
the man charged with attempted bodily harm is treated more le- 
niently than the man arrested for successfully committing the same 
crime, though both villains harbored the same evil intentions. How 
can the retributivist account for punishing attempts less severely than 
successful crimes?!’ In both cases, the person is just as evil. Further 


* But it should not be thought that the two particular forms of the right to be 
punished are incompanble. 

€ See Bernard Williams, “Moral Luck,” in his Moral Luck: Philosophical Papers, 
1973-1980 (New York: Cambndge, 1981), pp. 20-29; and Thomas Nagel, “Moral 
Luck,” in his Mortal Questions (New York: Cambridge, 1979), pp 24-39. 

» For a list of different ways in which “the natural objects of moral assessment 
are disturbengly subjected to luck,” see Nagel, pp. 28-29 

1! The advocate of deterrence cannot argue the distinction between success- 
ful and unsuccessful attempts came about or is fustfied by the belief that different 
sentences will motivate a criminal to stop before his attempt is concluded or will be 
an incentive not to shoot again after first missing the intended victim. In the latter 
case, if the suffer penalty for success did not deter in the first place, it is unlikely to 
prevent a second attempt where the criminal has even more incentive to try again. 
The additional motrvation is that he is more likely to be hended and con- 
victed since his unsuccessful attempt leaves alive wimesses to his misconduct. Given 
the criminal’s even greater incentive to make a second attempt than a first, the 
“logic of deterrence” suggests that the penalty for the second attempt—even if um 
successfulshould be harsher than the penalty for a successful first attempt. But we 
surely are not going to be comfortable with a system that allowed success on the 
first try to be punished less severely than two-time failures. For instance, the fami 
lies of murder victums will probably be repulsed by the idea that the Killers of their 
loved ones should be punished less severely than other cnminals who have done 
less harm because they have failed ın their attempts to kill someone. 
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more, if we adopt this retributivist rationale purged of any vindictive 
payback for actual harm, then, in principle, if we could read minds in 
a high-tech possible future, we might be obligated to punish those 
who would like to commit crimes but do not ever begin to set such 
events in motion for they fear punishment or cannot see how to over 
come other solely practical obstacles to the success of the desired 
crime. I say this because I do not see how most retributivists can draw 
a line distinguishing those with such evil thoughts, but who are inac- 
tive, from those who act on the same type of thoughts. The former 
are just as evil as those who actually break the law thinking they can 
avoid capture.’® It is just nonmoral, contingent factors that keep such 
people from attempting crimes. And, likewise, it is just nonmoral, 
contingent factors that keep attempted crimes from being successful. 

I do not believe, however, that most people care enough about the 
intrinsic qualities of others’ mental states to punish those with evil 
thoughts just because their thoughts are evil. Or, at least, they do not 
care to punish them as severely as those who are no more evil but do 
more harm.” So it is doubtful that a retributivist principle like that 
of Bradley’s, which calls for the “annihilation of the evil” embodied 
within a corrupt man, can provide a good account of not only our 
actual practices, such as punishing failed attempts less severely, but 
of our attitudes to counterfactual scenarios. 

The leeway given to vindictive feelings provides a way, however, to 
explain the asymmetrical punishment. The man who commits a vio- 
lent crime is punished more severely than someone who tried but 
failed to commit the same crime, since the greater harm the former is 
responsible for evokes a greater need in the victim, her friends, rela 
tives, and the society at large to retaliate in order to even the score. 
That is, since they were hurt more, they need more compensation.” 


* Only Morrisstyle retributrvists can justify punishing those who are evil and act 
on their evil, while refraining from punishing those who are equally evil but do not 
act. This is because the latter do not forsake the burdens of restraint and obtain 
unfair advantages with their criminal attempts. 

* Moreover, why not entrap the wicked on retributrvist grounds? This question 
was Taised—perhaps only in partial senousness—by Frances Myrna Kamm in con 
versation. 

™ If some people would insist upon punishing the inactive possessors of these 
distasteful mental states, it may be because many of them want to retaliate for the 
animosity, indifference, or some other form of disrespect they directed toward 
them. This ıs a form of revenge and not retribution. 

= It is worth noting that, traditionally, the area where moral luck has bothered 
us the least has been in matters of compensating victrms. For mstance, if two 
mean-spinted vandals break into different gallenes and each destroys an artwork 
whose value they were unaware of, it does not strike us as unfair that the one who 


unwittingly destroyed a more valuable painung must make a greater debt payment. 
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Autonomous Agents: From Self-control to Autonomy. ALFRED R. MELE. New 
York: Oxford University Press, 1995. viii + 271 p. Cloth: $55.00. 


The perennial nature of debates about human action, akrasia, and 
free will are such that one suspects either that everthing of value has 
been said on the issue or that we should question the parameters 
and background presuppositions within which they are conducted. 
Alfred Mele admirably surpasses the former limitation while making 
suggestive remarks which, if elaborated, might lead to the latter. His 
goals in this book are straightforward: to give an account of the ide- 
ally self-controlled agent, an account that accepts akrasia as a gen- 
uine possibility; to argue that additional conditions must be added to 
those of an ideally self-controlled agent for her to be autonomous; to 
lay out sufficient conditions for autonomy; and to argue that, with 
slight variations, such an account can be consistently adopted by 
both compatibilists and libertarians. His arguments are clear and his 
strategy perspicuous (even if his prose becomes a bit tortured in the 
service of precision and detail). Iam not sure he is successful at each 
of these endeavors, but his efforts surely provide a worthy addition to 
debates that have been thought by some to have atrophied long ago. 
Mele’s first enterprise is to give an account of intentional action 
that allows “decisive best judgment” to figure prominently. The 
problem he addresses concerns how explanations of intentional de- 
liberative action can retain a conceptual or causal tie between judg- 
ment and intentional action but allow for cases where we act against 
our decisive better judgments. Mele’s contribution to this puzzle is 
to argue that the phenomenon of self-control figures systematically in 
intentional action, and akrasia occurs when intervening acts of self- 
control fail to occur. Hence, even if judgment is understood in the 
motivational (rather than purely intellectual) sense, akratic action is 
possible because there are times when efforts at self-control fail to in- 
tervene between the judgment of what’s best and the competing 
(possibly stronger) desire to do otherwise (chapters two-four). Mele 
also applies this model of selfcontrol as an intervening factor in in- 
tentional action to cases of belief formation, where agents must over- 
come a variety of common cognitive biases, as well as processes of 
emotional control (though Mele by no means identifies “best judg- 
ment” with “reason”) (chapters five-six). With these arguments in 
hand, Mele constructs a picture of the ideally self-controlled person. 
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Perfect self-control is global and reaches overt actions, mental ac- 
tions, beliefs, emotions, and the like; it is always exercised in the ser- 
vice of decisive, nonakratically held, better judgments; it is exercised 
whenever appropriate; and it is always effective. 

The question, however, is whether such an ideally selfcontrolled 
person is autonomous. Mele’s answer is that she is not (necessarily). 
Despite being perfectly able to appraise critically and to act upon 
her desires and values, the perfectly selfcontrolled person may nev- 
ertheless be acting on desires and values that have been implanted 
into her by artificial means, ones whose development has bypassed 
normal reflection and awareness. In those cases, the person is self- 
controlled but not autonomous (121-26). 

Some might argue that autonomy is achieved as long as an agent is 
fully competent in turning her practical judgments into intentional 
action (though a fairly robust conception of “competence” might be 
developed on that score). Mele calls this an internakst conception. 
He argues, however, that there are external conditions required for 
autonomy. He motivates this by imagining an otherwise fully compe- 
tent deliberator who is able to act rationally upon values and desires 
she has critically reflected upon, but whose values have been incul- 
cated in her by a process of brainwashing undertaken against her 
will. If such a person is less than fully autonomous (and Mele claims 
that she obviously is), it must be in virtue of some factor external to 
the inner workings of her decision making and reflection, in particu- 
lar, relative to the etiology of the values upon which she reflects and 
acts. Moreover, Mele argues that heteronomy of this sort does not 
require that the processes of compelled manipulation be intention 
ally imposed on the agent (163-69). 

Mele goes on to argue that what must be true of the history of 
value formation (or any other characteristic relevant to autonomy) is 
that the person was not compelled to adopt the value or pro-attitude 
(at least not compelled in a way that she herself arranged). To be 
compelled is to be unable to shed the attitude in question and to 
have been conditioned to adopt it in ways that bypass the usual kinds 
of mental control normal agents enjoy—basic critical reflective ca- 
pacities as well as skills in self-control. 

Many will not be persuaded by Mele’s historical argument, per- 
haps for reasons similar to ones he puts forward himself in criticizing 
another historical theory of autonomy: aspects of the manner in 
which one developed one’s values are one thing, but one’s occurrent 
ability to act competently on them is another, and only the latter 
truly matters in the kinds of reactive attitudes, attributions of respon- 
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sibility, and the like that are the marks of autonomy. But, moreover, 
it is still unclear that the autonomy-defeating conditions Mele sets 
out—being compelled in a way that bypasses normal mental control 
to assess desires in light of reasons—do not obtain so generally that 
it makes autonomy in this sense an impossibility (especially if they 
need not be intentionally put in place for autonomy to be dis- 
rupted). Mele discusses the education of children, but argues that 
only in a narrow range of cases are the child’s capacities to consider 
choices in light of reasons enitrely bypassed, and in those cases we can 
say the child is robbed of an autonomy-producing upbringing. But 
we are all raised in relatively limited surroundings, in that we were 
not, could not have been, given access to information about radically 
alternative ways of life. Mele argues, in fact, that we can lack auton- 
omy when information that would crucially effect our decisions is in- 
tentionally withheld (180). So it is unclear how any of us is 
autonomous, given that, in many ways, everyone’s development takes 
place without full information about what it would be like to live in 
radically alternative environments (information or experiences 
which, if pursued, may have changed our values at the core). Con- 
sider reports by people who had the chance to live in another cul 
ture as a youngster and report how their values were radically 
changed as a result. What might establish the authenticity of our 
(necessarily) limited developmental experiences is that we do not 
retrospectively disapprove of them. But that is equivalent to a condi 
tion of historical autonomy that Mele, for other reasons, rejects (138-39). 

Mele’s initial formulation of autonomy is clearly a compatibilist 
one, in that all of the processes enumerated in the model can 
be causally determined, as long as they do not amount to 
compulsion (not arranged by the agent). But if an incompat- 
bilist libertarian were to insist that there must be some 
indeterministic element in the etiology of intentional acts for 
them to count as free (autonomous), then Mele has something 
to offer her as well (chapter twelve). To establish this 
indeterminacy, Mele focuses on that crucial juncture in decision 
making where certain beliefs “come to mind” during the process of 
deliberation. This is the indeterministic link that can occur amidst 
an otherwise deterministic process of deliberation and choice 
(captured by the compatibilist conditions of autonomy already 
described). On this view, a person has proximal control over 
how she deliberates given the beliefs she has in mind, and more 
than one physically possible future is open to her during delibera- 
tion due to this doxastic indeterminacy. So Mele presents alternative 
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conceptions of autonomy—one compatibilist and one libertarian— 
and this allows him to maintain an agnosticism about the final truth 
of compatibilism (251-54). 

In the end, however, I am not convinced that the picture of inde- 
terminacy Mele offers will do much to help the libertarian cause, 
and indeed it may show the crucial fissure in that position. His con- 
ception of “control” required by libertarian accounts of freedom 
(211) amounts to the requirement of some indeterminacy regarding 
action, establishing the central incompatibilist requirement that for 
freedom to obtain, more than one physically possible future must be 
possible for a person at the time of deliberation. But indeterminacy 
does not add up to control. Without a theory of agent causation 
(which Mele rightly thinks is implausible), it is utterly mysterious 
how an indeterminate process—whether in the formation of beliefs 
or desires or whatever—can amount to the kind of self-control re- 
quired for freedom. If the explanation for my action contains an ele- 
ment of randomness concerning belief formation, for example, then 
it does not matter, from my perspective, what the source of that be- 
lief is; it occurs to me “out of nowhere,” as it were. But if that is so, it 
would not matter if that belief were placed in my deliberative path 
by a clever manipulator (and if one insists on an element of indeter- 
minacy, just imagine that the manipulator merely arranges a high 
probability that I shall come upon the surreptitious belief). Since I 
certainly lack the relevant control in the latter case, why do I retain it 
in the case of Mele’s doxastic indeterminacy? 

Mele ends his book with a response to those incompatibilists who, 
unlike their libertarian cousins, deny the very possibility of auton- 
omy (chapter thirteen). In response to an argument of this sort by 
Richard Double, Mele argues that merely because there is inter- 
minable debate about the nature of a concept or the details of its 
conditions, as there is with concepts like “freedom,” that does not 
mean that such phenomena do not exist: consider the interminable 
debates about the nature of intentional action—we certainly do not 
conclude in that case that there is no such thing as intentional ac- 
tion (239-43). 

These observations begin to reveal the limitations of such detailed 
theorizing about the conceptual structure of complex ideas as a 
method of solving long-standing metaphysical conundrums. Mele 
masterfully winds his way through the complex pathways of this con- 
ceptual modeling, but in the end it may well be necessary to see con- 
cepts like “autonomy” and “freedom” as fully embedded in 
historically grounded social practices rather than free-floating ele- 
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ments subject to skillful dissection. Indeed, as Mele points out in his 
discussion of arguments for the impossibility of autonomy, it is often 
unclear which conception (of “autonomy,” “responsibility,” or the 
like) a theorist should choose in showing the metaphysical incom- 
patibility of that concept with other (equally malleable) concepts. 
One may just have to admit that the key notions in these debates 
may well be too blurry at the edges, and too variable across the dif- 
ferent institutional and social settings in which they are used, for any 
final determination concerning their compatibility to be made apart 
from analyses of those settings themselves. 


Pennsylvania State University 


JOHN CHRISTMAN 


A Philosophy of Mass Art. NOEL CARROLL. New York: Oxford University 
Press, 1998. xii, 425 p. Cloth $77.00, paper $18.95. 


With bis new book, Noël Carroll intends to baptize mass art—art 
produced and distributed for consumption at multiple and discrete 
sites of recepiion—as a respectable subject for systematic aesthetics. 
Toward this end, he exhaustively surveys the entire range of topics 
related to mass art which might be of philosophical interest. After 
opening chapters on twentieth-century antagonists and friends of 
mass art, Carroll advances his own definition of the concept, to 
which I shall turn below. Through a dialectical engagement with the 
competing theories advanced by David Novitz and John Fiske, Car- 
roll then spins out the ontological and epistemological implications 
of his position. Finally, his definitional work completed, Carroll adds 
three chapters applying recent research on the relation between art 
and, in order, the emotions, morality, and politics, to mass art. 

As this summary indicates, A Philosophy of Mass Art is a large-scale 
undertaking, ambitious both in providing an overview of the state of 
philosophical work on its subject and in pointing out future direc- 
tions for inquiry. At the heart of Carroll’s project, however, there isa 
puzzling piece of advice to his fellow aestheticians: we ought to take 
this program seriously because we ought to pay more attention to 
mass art. This is puzzling because Carroll also claims, quite reason- 
ably, that these days all people are aswim in a sea of mass art and so 
cannot help experiencing it. If we add the likely premise that aes- 
theticians are people, too, this second claim entails that we are al- 
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ready paying attention to mass art all the time—and if that is true, 
then Carroll’s advice seems otiose. 

Of course, Carroll intends the claim that mass art is the dominant 
art of our time to be a justification for the claim that aestheticians 
ought to be interested in it: we ought to pay more attention to it be- 
cause it is the aesthetic air we breathe. This, however, sounds like the 
stalker’s claim to his victim that she ought to be interested in him be- 
cause he phones so often. This sort of justificatory argument cannot 
be right, because at least part of what it means to say that mass art 
(like the stalker) is everywhere is that we, even we philosophers, al- 
ready respond to it on its own terms just by opening our eyes or ears. 
Carroll’s ‘ought’, which proposes that we assume also a theoretical 
interest, has a hard time gaining a purchase just because of the expe- 
rience over which it seeks to exercise command. 

This problem can be brought out vividly by considering Carroll’s 
definition of mass art: 


X is a mass artwork if and only if 1. x is a multiple instance or type art- 
work, 2. produced and distributed by mass technology, 3. which artwork 
is intentionally designed to gravitate in its structural choices (for exam- 
ple, its narrative forms, symbolism, intended affect, and even its con 
tent) toward those choices that promise accessibility with minimum 
effort, virtually on first.contact, for the largest number of untutored (or 
relatively untutored) audiences (196). 


The pivotal, controversial claim here is that mass art offers its mean- 
ings without requiring much in the way of education and thus is, ten- 
dentially, art for everyone. Indeed, Carroll even argues that mass art 
gravitates toward pictorial media because our responses to pictures, 
unlike to our native languages, are biologically hardwired, requiring 
no education, strictly speaking, at all (192). But if the defining inten- 
tion of mass artworks is to render reflection on their reception un- 
necessary, does it not follow that they seek to render philosophy 
unnecessary? Even more strongly: if mass art fulfills its “promise” in 
virtue of rendering idle the kinds of critical and philosophical reflec- 
tion demanded by complex artworks, would having a philosophy of 
them not get in their way? 

As Carroll well knows, getting in the way of mass art has been the 
typical philosophical response to it. His opening chapter discusses 
the views of Dwight MacDonald, Clement Greenberg, R. G. Colling- 
wood, and T. W. Adorno, only to find them all wanting in consis- 
tency, cogency, or both. For Carroll, what joins these figures 
together so that they may be dispatched as a group is that, despite 
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the widely various sources of their supposed failings, they all offer 
“philosophical resistances to mass art.” Carroll’s grouping thus cop- 
flates two separate points, however, for it suggests that MacDonald et 
alia fail both because their individual arguments are bad and be- 
cause they try to resist mass art’s promise of easy accessibility. That it 
is the second of these failures that constitutes Carroll’s significant 
charge against this group is signaled by his identifying a collective 
“inability to come to terms with mass art” over and above the individ- 
ually inadequate arguments. Resisting mass art, though, if it is a sec- 
ond “failure” in addition to the mounting of bad arguments against 
it, is an affective failure which might be better named “refusing to 
swallow.” Unless Carroll can show that gagging is an improper re- 
sponse, however, his advice to take mass art seriously amounts to the 
paradoxical encouragement to swallow what is, by his own definition, 
designed to be easy to swallow. 

In order not to presume that the gag reflex is a philosophical er- 
ror, Carroll argues that resistance to mass art is a consequence of a 
deep and widely shared philosophical mistake. Specifically, he avers 
that a misplaced Kantianism is common to all the resisters and that 
this theoretical idol makes them unable to come to terms with mass 
art. According to Carroll, Collingwood et alia use “fragments of the 
Kantian theory of free beauty to presuppose a theory of art proper” 
relative to which mass art is judged bad art or not art at all (105); 
this, he continues, is a two-fold mistake because (a) “Kant had no 
theory of art at all” (89) and (b) Kantian theories of art are “alien or 
obsolete frameworks” which are therefore “utterly insensitive to the 
fact of mass art” (108). Even leaving aside that these two reasons 
prima facie contradict one another (which may be only a rhetorical 
slip on Carroll’s part) and that poor Collingwood gets peculiarly 
squeezed into generic Kantianism, this argument still has little to 
recommend it as a genealogy of the resistance to mass art. What, for 
instance, is one to make of the claim that Kant had no theory of art? 
On the presumption that Carroll has not forgotten sections forty- 
three through fifty-three of the Critique of Judgment in which Kant de- 
velops a rich theory of art, perhaps he ought to be interpreted as 
saying overemphatically that Kant had a bad theory of art. And per- 
haps he did, but Carroll’s claim that he had none at all gets in the 
way of his saying why. Instead, we get the observation, delivered in 
the peremptory voice of the guardian of the dustbin of history, that 
Kantianism is obsolete and alien. This, however, reduces the argu- 
ment to circularity: the judgment that the theory is obsolete is sup- 
ported by the judgment that holders of it cannot come to terms with 
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mass art, while the judgment that what the holders are doing is not 
coming to terms is supported by the judgment that the theory is ob- 
solete. Still, Carroll uses this circularity as his basis for concluding 
that the resisters of mass art are insensitive to it. Whether Green- 
berg, Adorno, or Collingwood were insensitive is an opinion best 
evaluated by individual readers, but that each dogmatically set him- 
self against accepting art designed to be accessible on the basis of a 
misplaced, even while nonexistent, Kantian theory of art is an argu- 
ment that rests on premises and arguments that are unsound. 

Even if Carroll could demonstrate that the taste of the resisters was 
informed by wrong philosophical commitments, however, his claim 
that resistant taste amounts to insensitivity to mass art begs the ques- 
tion whether mass art really is easy to swallow or, put in Carroll’s 
terms, whether what is designed to be accessible is therefore really 
accessible. If it is, then resistance to it would be a species of nonre- 
sponsiveness, of dogmatic taste, regardless of whether such resis- 
tance is motivated by bad philosophy. There is at least one good 
reason, however, to doubt that mass art is simply accessible, which 
has as a consequence that there is at least one good reason to think 
resistance to it is the proper response. We can begin to bring this 
reason to light once we notice that Carroll’s claim that resistance is a 
perverse response presumes that an artwork (or any artifact) de- 
signed to be accessible without much or any specialized background 
knowledge and thus designed to be used without reference to such 
knowledge will, if its design is well executed, be familiar and easy to 
operate. This, though, is a nonsequitur. After I learned to program 
my video cassette recorder, along came VCR+ with its promise of 
easy operation; nonetheless, I was befuddled by it because its easy ac- 
ceas features were novel relative to my already developed videotaping 
skills. The new technology was inaccessible to me just because it was 
designed to be accessible, Similarly, a friend of mine who rarely 
reads airport novels found The Bridges of Madison County impenetra- 
ble. In the absence of characters with psychological depth or a 
theme worthy of literary exploration, she complained, there was no 
way into the book as an instance of any literary genre with which she 
was familiar. Granted that the absence of these features was designed 
for easy access to the story (which, of course, she understood just 
fine), what was written was still, relative to a specific set of reading 
skills, inaccessible and unfamiliar. 

The reason Carroll does not notice this problem is that he infers 
from the fact that certain skills of, say, reading, such as following a 
plot, are widely or even universally distributed, that a novel designed 
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to draw on those skills alone will be accessible (think of this as his Es- 
peranto problem). But this inference skips over the fact that in the 
absence of an industry intending to cash in on the naked universality 
of universally distributed skills, mass artworks would be utterly unfa- 
miliar and so hard even to recognize. What we swallow easily is not 
what is designed to be acceasible, but the familiar, and these two con- 
cepts do not mean the same thing. 

It is a second puzzle that A Philosophy of Mass Art elides this distinc- 
tion since Carroll has equipped himself with all the tools needed to 
make it. One of the first points he makes in defining mass art is to 
urge us not to confuse it with popular art; indeed, he plants his flag 
deeply on this matter by writing that “perhaps the most misleading 
way to label the phenomenon is to call it popular art” (185). If by 
popular art we mean the “art of the common people,” then, he ar- 
gues, popular art has always existed, whereas maass art, on the other 
hand, is specific to “modern industrial mass society.” I agree that this 
is an important distinction. If we recall, however, that by Carroll’s 
own definition mass art is also an art of the common people in the 
sense that it draws on universally distributed capacities, then we 
ought to infer that mass art is the popular art of modern industrial 
mass society. This conclusion, though, contradicts the very distinc- 
tion Carroll insists on. He tries to resolve this problem by specifying 
the audience for mass art as a mass audience that crosses “national, 
religious, political, ethnic, racial, and gender boundaries.” This, 
however, just is the distinction between the familiar and the de- 
signed-to-be-accessible, since an art that is consumable by its in- 
tended audiences only by means of the disavowal of all the vehicles 
of human identity would be, almost by definition, an art unfamiliar 
to most humans. Now, if Carroll imported this distinction between 
the familiar and the designed-to-be-universally-acceasible back into 
the fundamental distinction between popular and mass, then he 
could thematize mass art as a kind of production and distribution 
which aims to establish historically novel alternatives both to popular 
art and to the common people; this, in turn, would allow him to 
block the problematic conclusion that mass art is our popular art. 
Oddly, though, Carroll refuses this saving strategy by arguing instead 
that the difference between the audiences for the two is merely nu- 
merical: popular art is liked by “lots” of people, mass art by “lots and 
lots” of people (187). This is clearly a distinction without a differ- 
ence which effectively erases the distinction between mass art and 
popular art. Instead, Carroll could and should have crafted a real 
distinction by considering how the mass audience intended by mass 
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art is an audience that is conceptually and practically different in 
kind from the culturally differentiated audiences presupposed by 
popular—familiar—arts. 

Of course, it may turn out (or already have turned out) that this dis- 
tinction between mass art and popular art gets erased as a matter of 
historical fact, that what is designed to be accessible will become the 
most familiar. For that to happen, though, two other historical 
changes in the audience for art will have to happen also. First, adult 
human beings will have to undergo a process of infantilization 
whereby they develop a taste for what is designed to be readily under- 
stood. I call this infantilization because it returns us to our early aes 
thetic pleasures, such as sing-songy poems and stories of mighty 
heroes; through those simple pleasures we developed skills that would 
be used later in meeting more challenging experiences, but as mass 
art becomes familiar art we are uncannily confronted by our childish 
pleasures, recalling us to our more primitive skills, in the guise of 
adult entertainment. Second, human beings will have to develop a dis- 
taste for what will have been familiar art previously. When partisans of 
local cultures complain about the Americanization of global culture, 
they are complaining about mass art (the production of which is over- 
whelmingly American) driving out the familiar arts of diverse locales. 
Looked at from the nonmassified, that is, local and locale-specitying, 
context, that mass art is designed to be accessible means that it is de- 
signed to make otherwise familiar and popular forms inaccessible. 
Carroll rightly claims that mass art and avant-garde art are the alterna 
tive forms of artistic modernity, but this leads him to think that there- 
fore avant-garde art is the privileged antagonist of mass art. In fact, 
though, while from its own perspective the avant-garde may be locked 
in a contest with mass art, mass art is locked in a contest with the pop- 
ular arts which it must render unfamiliar if it is to turn its own design 
imperative of accessibility into familiarity. Mass art, in short, is not ac- 
cessible in virtue of being designed to be; rather, it becomes accessible 
by driving out the ability to respond to art that demands reflection. It 
is this historical dynamic to which the philosophical resistances to 
mass art respond in their setting philosophy hopefully against mass art 
(and if aesthetics is not up to the job, so much the worse for it). 

If I am right here, then my earlier reference to the stalker was 
more than an analogy. Mass art manifests the same logic as the 
stalker: they are both designers of the accessible which insist that 
they are thereby our familiars. But just as it would be an addled 
piece of advice to a stalker’s victim that she should “come to terms” 
with the stalker, so, too, it is strange in the case of mass art. In the 
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first case, we would advise the victim to figure out effective ways to 
keep the stalker away—to keep him unfamiliar despite his design im- 
peratives—for the sake of a life lived better in virtue of not being ab- 
sorbed, theoretically or otherwise, in false and damaging offers of 
intimacy. So, too, for systematic aesthetics, the resistance to mass art, 
not despite its design imperatives but because of them, is the proper 
response in the sense that it is the proper way not to “come to terms” 
with it. Its insinuating familiarity, calculated, as Carroll argues, to 
render reflection unfamiliar, earns it our odium. 
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OF CONSPIRACY THEORIES* oa 2y 


“The only thought which philosophy brings with it, in regard to history, ik ~~ i 
the simple thought of Reason—the thought that Reason rules the world, 
and that world history has therefore been rational in its course.” 


—G.W.F. Hegel, The Philosophy of History 


“Shit happens.” —-Popular contemporary bumper-sticker slogan 


he millennium is nigh, and with each passing year, the Ameri- 
can consciousness is increasingly in the grip of conspiratorial 
thinking.’ Some conspiracy theories are the stuff of legend. 
Every year best-selling books are published, block-buster movies pro- 
duced, and high-rated television and radio programs aired which 
seek to convince us that Lee Harvey Oswald did not act alone in the 
assassination of John F. Kennedy; that, in 1947, an alien spacecraft 
crashed near Roswell, New Mexico, and the United States govern- 


* I wish to thank my philosophical colleagues at Washington University/St. 
Louis, where this paper was originally presented. Special thanks to David Hilditch, 
Pim Haselager, Pete Mandik, Jim Moore, and Chase Wrenn for insightful com- 
ments and discussion. 

! Conspiracy theory has not been given much attention by philosophers. In fact, 
I am aware of only a handful of discussions: for example, Karl Popper, The Open So- 
cisty and Its Enemies, Volume 2° The High Tide of Prophecy. Hegel, Marx, and the After 
math (London: Routledge, 1966, 5th ed.}, pp. 9499; and Charles Pigden, “P 
Revisited, or What Is Wrong with Conspiracy Theories?” Philosophy of the Social Sc 
ences, XXV (1993): 3-34. I believe that the reason for this omisuon is that most acad- 
emics simply find the conspiracy theones of popular culture to be silly and without 
merit. I believe, however, that it ıs incumbent on philosophers to provide analysis 
of the errors involved in common delusions, if that is indeed what they are. I offer 
this paper in the spirit of Philip Kitcher’s work on the philosophical difficulties of 
scientific creatlonism—Abusing Samce: The Case agarnst Creationism (Cambridge: 
MIT, 1982). 
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ment recovered the craft and its extraterrestrial occupants, then cov- 
ered up with stories of crashed weather balloons; that the rapid in- 
troduction of crack cocaine into America’s urban centers during the 
late 1980s was facilitated by Central Intelligence Agency-backed, 
Nicaraguan Contra-affiliated, drug smugglers. This plot benefitted 
both the CIA (by channelling money to the Contras after Colonel 
Oliver North’s Iranian arms deals had been uncovered) and their 
status quo seeking supporters (by keeping the economic and cub 
tural boot firmly on the neck of Black America). 

Here is a test of your conspiracy literacy. How many of the follow- 
ing conspiracy theories have you heard about before? How many are 
you tempted to believe have merit? 


(1) HIV (human immunodeficiency virus), the virus that causes AIDS 
(acquired immune deficiency syndrome), was the product of Ameri- 
can or Soviet biological warfare research before it was released (et 
ther intentionally or otherwise) on an unsuspecting world. 

(2) Extraterrestrials regularly visit our planet, mutilating cattle and ab- 
ducting humans (whose memories are then erased). Our govern 
ment is aware of this situation. 

(3) Los Angeles once had an efficient mass transit system based on 
street-cars, but in the 1930s and 1940s automakers, rubber manufac- 
turers, and oil companies colluded with city officials there to dis- 
mantle this system in order that Los Angeles could become a model 
city of automobile-based transportation. 

(4) Trans World Airlines Flight 800 was accidentally downed by a U. S. 
Navy missile; a fact then covered up by a government fearful of bad 
press in an era of post-Cold War military downsizing. 

(5) All transatlantic communications are monitored and recorded by 
the U. S. National Security Agency. 

(6) Significant aspects of the world economy are under the control of a 
small group of individuals, be they Freemasons, the Trilateral Com- 
mission, or a secret organization of Jewish bankers. 


I could go on. The point of this list is to make clear just how perva- 
sive conspiracy theory now is. Prognostication is a dangerous busi- 
ness, but I predict that future cultural critics and sociologists are 
going to have as much to say about our contemporary obsession with 
conspiracies as they now have to say concerning the implications of 
the 1950s American fascination with Unidentified Flying Objects and 
extraterrestrial invasions. 

I 
The present essay is epistemological, however, not sociological. I 
contend that the study of conspiracy theories can shed light on the 
nature of theoretical explanation. Conspiracy theories, as a general 
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category, are not necessarily wrong. In fact, as the cases of Watergate 
and the Iran-Contra affair illustrate, small groups of powerful indi 
viduals do occasionally seek to affect the course of history, and with 
some nontrivial degree of success. Moreover, the available, compet- 
ing explanations—both official and otherwise—occasionally repre- 
sent dueling conspiracy theories, as we shall see in the case of the 
Oklahoma City bombing. 

The definition of conspiracy theory poses unexpected difficulties. 
There seems to exist a strong, common intuition that it is possible to 
delineate a set of explanations—et us call them unwarrented conspiracy 
theories (UCTs).? It is thought that this class of explanation can be dis 
tinguished analytically from those theories which deserve our assent. 
The idea is that we can do with conspiracy theories what David Hume’ 
did with miracles: show that there is a class of explanations to which 
we should not assent, by definition. One clear moral of the present essay 
is that this task is not as simple as we might have heretofore imagined. 

Before continuing, I should emphasize that at no point should the 
reader conclude that I am giving arguments for or against the truth 
of any given explanation. The issue here is not whether aliens are in- 
deed visiting our planet, or whether Oswald acted alone. Ultimately, 
in these cases, there is a historical fact of the matter. These facts are 
not manifest, however, and we must theorize and speculate as to 
what has happened. The issue here is one of warranted belief. In 
other words, it may well be correct that “the truth is out there,” but 
given our epistemic situation, we ought not necessarily believe every- 
thing which is, in fact, true. In this respect, we are in the same situa 
tion as Hume. As Thomas Huxley‘ observed, Hume cannot say that 
miracles have never happened, only that, even if they have, we have 
no warrant to believe them. Hume has no way of determining, with 
certainty, whether Jesus turned stone into bread and fed the mult- 
tude. Maybe He did or maybe he did not. Hume is in a position, how- 
ever, to say whether we ought to believe this miracle occurred given the 
evidence at hand (or even given the possible evidence at hand). 

Understanding why we are not warranted in believing certain con- 
spiracy theories can make clearer why we ought to believe the things 


t Other, less charitable, readers have suggested such labels as ‘kooky’, ‘weirdo’, 
and ‘harebrained’, among others. While colorful, I do not find such terminology 
conducive to serious discussion 


> “Of Miracles” (section X of Enquires concerning Human Understanding, 1748), 
EE SENA ed., Hume on Mrracies (Bristol: Thoemmes, 1996), pp. 1- 


a “The Order of Nature: Miracles” (chapter VII of Huxley's Hume, 1881), 
repnnted in Tweyman, pp. 161-68. 
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that we should. I propose to make a study of UCTs, and in determining 
where they go wrong, attempt to tell a story about explanation properly 
conducted. In the following section, I shall turn to a recent event that 
has spawned a significant number of conspiracy theories: the Okla- 
homa City bombing. ‘This discussion of a real event and the conspiracy 
theories (some warranted, some not) will help me illustrate the analysis 
provided in the rest of the paper. In section m, I shall discuss the prob- 
lem of trying to define UCTs and illustrate the difficulties for finding 
analytic criteria for distinguishing good from bad conspiracy theories. I 
turn to the alleged virtues of UCTs in section IV in order to explain 
both their current popularity as well as the grounds we have for ulti 
mately rejecting them. Such explanations feature significant degrees of 
explanatory breadth and are not simply unfalsifiable in nature. But 
continued belief in a UCT requires more and more pervasive skeptt 
cism in people and public institutions. This leads me in section V toa 
discussion of conspiratorial thinking in the context of competing vi 
sions of the nature of the world. I suggest that confronting UCTs forces 
us to choose between the pervasive skepticism entailed by these theo- 
ries and an absurdist view of the world entailed by their rejection. I con- 
clude by suggesting that it is philosophy’s job to show us the way out of 
this dilemma. 
n 

To give us a concrete example of conspiracy theory in action, it is 
necessary to deal with one in some detail. The detail is relevant be- 
cause it is from the details of such events that UCTs take their start. I 
hope the reader will excuse this short diversion. 

At a few minutes past nine on the morning of Wednesday, April 
19, 1995, a Ryder rental truck—parked in front of the nine-story AF 
fred P. Murrah Federal Building in OKlahoma City, Odahoma—is 
torn apart by a powerful explosion. Some 168 people, including— 
most tragically—nineteen children in the building’s day-care center, 
are killed in what has since been designated the single worst act of 
terrorism ever carried out on American soil. 

As emergency personnel move in to rescue survivors from the rub- 
ble, a huge federal investigation is launched. Initial suspicion falls 
upon possible Middle-Eastern terrorists. This turns out to be a dead- 
end.’ After tracing the destroyed truck to Elliott’s Body Shop in Junc- 
tion City, Kansas, attention is soon focused on two white males in 
their twenties. Composite sketches of “John Doe #1” and “John Doe 


t Although not before Oklahoma resident and American citizen, Abrahim Ab- 
mad, is detained in London, strip-searched, and returned to the United States in 
shackles. He 1s quickly ruled out as a suspect and released. 
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#2” are released to the public on the following day. A nationwide 
manhunt ensues. 

Meanwhile, it happens that at 10:20 am on the morning of the 
bombing, Officer Charles Hanger of the Oklahoma State Troopers 
spotted a northbound 1977 Mercury Marquis on Interstate 35 ap- 
proximately sixty miles north of Oklahoma City. The car was trave} 
ling at eighty miles per hour and lacked visible license plates. Officer 
Hanger pulled the car over without incident, but as he approached 
the lone driver, he noticed a suspicious bulge under the driver’s 
jacket. The driver volunteered that he was indeed armed, at which 
point the officer drew his own weapon, pointed it at the head of the 
driver, and disarmed him of a loaded pistol and a sheathed fiveinch 
bladed knife. The driver, twenty-six-year-old Persian Gulf War vet- 
eran Timothy McVeigh, was arrested on charges of transporting a 
loaded weapon, carrying a concealed weapon, and operating a mo- 
tor vehicle without license plates. McVeigh was still in a cell awaiting 
arraignment two days later, when he was identified as “John Doe #1” 
and linked to the Oklahoma City bombing. 

The investigation continued and, some two years later, McVeigh, 
along with Terry Nichols and Michael Fortier, were convicted in con- 
nection with the case. McVeigh and Nichols were tried on a charge 
of “Use of a Weapon of Mass Destruction,” and several other counts, 
including eleven counts of First-degree Murder for the eleven fed- 
eral agents killed in the blast. (The government contends that the 
defendants intentionally set out to kill federal agents.) Fortier plead 
guilty to lesser crimes in relation to the bombing in return for his 
testimony against McVeigh and Nichols. Nichols was convicted on 
manslaughter and conspiracy charges, but the jury refused to convict 
him of the more serious crime of murder. McVeigh was convicted on 
all counts and sentenced to death. 

In its cases against McVeigh and Nichols, the government con 
tended that McVeigh, Nichols, and Fortier planned and carried 
out the bombing of the Oklahoma City federal building as an act 
of terrorism against what they saw as a powerful and dangerous 
governmental entity. (The government claims that McVeigh and 
his co-conspirators were inspired in part by A. MacDonald’s The 
Turner Diaries, a novel which describes a racial revolution in 
America and which begins with a small group bombing the Fed- 
eral Bureau of Investigation headquarters in Arlington, Virginia.) 
McVeigh, so goes the story, was particularly upset by the deadly 


* (Washington, D.C: National Alliance, 1978; New York: Barricade, 1996, 2nd 
editon)—latter cited here 
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federal raid of the Branch Davidian compound in Waco, Texas, 
and sought to avenge the deaths of these victims of federal heavy- 
handedness. (The Oklahoma City bombing occurred on the sec- 
ond anniversary of the fiery end of the Waco stand-off.) So, 
McVeigh (trained in explosives by the U. S. Army) and Nichols 
constructed a powerful ammonium nitrate and fuel oil bomb in 
the back of a rented truck and detonated it in front of the Okla 
homa City federal building. 

The bombing provides us with a nice example of the dynamics of 
conspiracy theory. Within days of the event, questions were raised 
about the official account of the bombing (itself calling for a con 
spiracy): Did McVeigh, Nichols, and Fortier act on their own, or 
were they a small part of a larger team? Were they set up as “patsies” 
to take the fall for the crime, as Oswald claimed he was before Jack 
Ruby gunned him down? Was the bombing perhaps carried out e+ 
ther directly or indirectly by elements within the U. S. government 
who wished to sway public sentiment toward greater lawenforce- 
ment powers and against “far right-wing” ideologues who have been 
increasingly setting themselves at odds with federal authorities? 
Many such questions have been raised in the years following the 
bombing. : 

The grounds for such conspiracy theories are investigated in a 
book, OKBomb! Consptracy and Cover-up, by Jim Keith (perhaps best 
known for his 1995 Black Helicopters over America: Strikeforce for the New 
World Order).’ Keith does not endorse any particular alternative ac- 
count, but instead raises many issues and questions for the official 
account. For example, he casts doubt upon the alleged guilt of the 
government’s prime suspect by listing eleven, “striking incongruities 
in the behavior of Timothy McVeigh prior to and during his arrest,” 
including the following: 


(1) “Of high curiosity is the fact that McVeigh, even though reported to 
possess fake I.D. under [other names], chose to give his correct 
name and address to the owner of the Dreamland Motel he was stay- 
ing at in Junction City [just prior to the bombing]. This is hardly 
the behavior of a man planning on committing a crime of the mag- 
nitude of the Oklahoma City bombing.”* 

(2) We are to believe that McVeigh was fleeing the scene of the crime in 
a car without license plates: “All the car seemingly lacked was a big 
sign saying ‘Arrest mel’” (whid, p. 23). 


’ Lilburn, GA: HlumiNet, 1996 and 1995, respectrvely. 
* OKBomb, pp. 28-29. 
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(3) “If McVeigh had just taken part in the bombing of the Murrah 
Building [...], is it likely that he, armed with a pistol, would have let 
a police officer approach his car without resisting him? As a man 
tramed in the use of small arms, McVeigh would not have had much 
trouble in gunning the officer down from within the protection of 
the car” (thid., p. 30). 

(4) When McVeigh was arrested, he gave as his address the property of 
James Nichols, the brother of Terry Nichols, allegedly the place 
where the bomb-making materials were stored: “Does it make any 
sense that McVeigh would have pointed the FBI to his alleged part- 
ner or partners in crime...” (thd, p. 31)? 


Keith adds to the above observations the following: there were 
early news reports suggesting that the Bureau of Alcohol, Tobacco, 
and Firearms (BATF) had received prior warning of the bombing, a 
claim supported by the fact that no BATF employees were in the 
building at the time of the blast, but were on the scene within min- 
utes. The BATF vociferously deny that they were forewarned, but no 
explanation has ever been given of the early reports. And there is 
the mysterious “John Doe #2”—never caught and left unexplained 
by the official story that led to the convictions of McVeigh and 
Nichols. 

The answer to these questions and others raised in Keith’s book 
are intended to lead the reader to the conclusion that all is not as we 
have been led to believe. Probing beneath the surface of the facts of 
the case as they have been presented in the mainstream media re- 
veals the real possibility that the Oklahoma City bombing may have 
not been the act of a lone group of politicallymotivated criminals, 
but rather suggests something much bigger and much more sinister. 
One popular conspiracy theory related to the bombing goes like 
this: a group of right-wing ideologues (including McVeigh, Nichols, 
Fortier, and the mysterious John Doe #2) were indeed plotting to 
blow up a federal building. Their actions were being monitored by 
the BATF, however. (On some accounts, their actions were being in- 
fluenced by the BATF. John Doe #2 was actually a BATF informer, or 
perhaps even a plant. The BATF hoped that swooping in and sting- 
ing a group of “dangerous, right-wing terrorists” at the very last mo- 
ment would do much to erase their public image as an organization 
of bumbling incompetents resulting from the fiasco in Waco.) In any 
case, McVeigh and his friends were involved, but only tangentially. 
McVeigh helps assemble the bomb, but he is unaware of the exact 
plans for its use, or is actively misled. At the last moment, the BATF 
screws up, loses contact with the group or are outsmarted by them 
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and the terrorists successfully carry out their act of terror. 
McVeigh—unaware that the bombing has occurred—is picked up by 
the police. The BATF realize that they have a public relations night- 
mare on their hands: they knew about the bombing, but through 
sheer incompetence and a desire to grandstand, failed to prevent it. 
When McVeigh is picked up in an unrelated incident, they see their 
chance to cover up their own involvement in and knowledge of the 
incident. He is the perfect patsy because he does have some involve- 
ment in the incident, but does not know the whole story. 

Notice that this account is not completely without plausibility. The 
BATF does have a public image of incompetence and institutional in- 
security. On most accounts, they fouled up the original raid on the 
Branch Davidian compound because they were more interested in 
national television coverage than in competent law enforcement. It 
makes sense to suggest that this is an image they are actively seeking 
to overcome. Also, this would not be the first time that a federal un- 
dercover agent has incited not-so-innocent citizens almost to carry 
out crimes they might not have without such encouragement. Fi- 
nally, it certainly would not be the first time that a lawenforcement 
agency has covered up the fact that it has incompetently allowed a 
crime to be committed. 

m 

What is a conspiracy theory? A conspiracy theory is a proposed expla- 
nation of some historical event (or events) in terms of the significant 
causal agency of a relatively small group of persons—the conspira- 
tors—acting in secret. Note a few things about this definition. First, a 
conspiracy theory deserves the appellation “theory,” because it prof- 
fers an explanation of the event in question. It proposes reasons why 
the event occurred. Second, a conspiracy theory need not propose 
that the conspirators are all powerful, only that they have played 
some pivotal role in bringing about the event. They can be seen as 
merely setting events in motion. Indeed, it is because the conspira- 
tors are not omnipotent that they must act in secret, for if they acted 
in public, others would move to obstruct them. Third, the group of 
conspirators must be small, although the upper bounds are necessar- 
ily vague. Technically speaking, a conspiracy of one is no conspiracy 
at all, but rather the actions of a lone agent. 

This then might be considered a bare-bones definition of conspir- 
acy theory, be it warranted or otherwise. UCTs—at least the ones 
that concern me in this essay—have a number of additional charac- 
teristics: 


(1) A UCT is an explanation that runs counter to some received, offi 
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Central to any UCT is an official story that the conspiracy theory 
must undermine and cast doubt upon. Furthermore, the presence of 
a “cover story” is often seen as the most damning piece of evidence 
for any given conspiracy; the fact that someone has gone to such 
pains to create a false explanation points to a conscious effort to de- 
ceive. 
(2) The true intentions behind the conspiracy are invariably nefarious. 


I am aware of no popular conspiracy theory according to which 


some group of powerful individuals is secretly doing good but desper- 
ately hopes its schemes will not be revealed. 


(3) UCTs typically seek to tie together seemingly unrelated events. 


You might think that the Oklahoma City bombing had nothing to do 
the 1995 Sarin gas attack in the Tokyo subway. What if I told you, 
however, that the appropriate anniversary was not the federal raid in 
Waco (which occurred two years earlier to the day) but rather the 
next day, one month to the day since the gas attack in Tokyo? The 
connection is that the attack in Tokyo was carried out by the CIA in 
retaliation for the Japanese bugging of the Clinton White House, all 
part of an ever-escalating Japanese-American trade war. So the Okla- 
homa City bombing was a “payback” hit carried out by the Japan- 
ese...” As will be discussed below, this unifying aspect of conspiracy 
theories is a key feature of their apparent explanatory strength. 


(4) As noted, the truths behind events explained by conspiracy theories 
are typically well-guarded secrets, even if the ultimate perpetrators 
are sometimes wel-known public figures. 


What seems to drive many conspiracy theorists is the deeply held be- 
lief that, if only the truth can be publicly revealed (in much the way 
that Bob Woodward, Carl Bernstein, and “Deep Throat” exposed the 
Watergate break-ins), the conspirators’ nefarious plans will be 
thwarted. Therefore, it is in the interest of conspirators to see to it 
that the truth is not revealed, or if it is, that it is not widely believed. 


(5) The chief tool of the conspiracy theorist is what I shall call errant 
data. 


* Let me note here that J am not making this stuff up. Throughout this paper, the 
examples I offer all come from sources other than myself. For example, see Keith’s 
OKBom5!, chapter 21, for more on the “Japanese connection” to the Oklahoma 


Gity bombing. 
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Errant data come in two classes: (a) unaccounted-for data and (b) 
contradictory data. Unaccounted-for data do not contradict the re- 
ceived account, but are data that fall through the net of the received 
explanation. They are data that go unexplained by the received ac- 
count. For example, the early reports that the BATF had prior warn- 
ing of the Oklahoma City bombing and the fact that no BATF 
employees were in the building at the time of the explosion repre- 
sent unaccounted-for data with respect to the received account of 
the bombing. Contradidory data are data that, if true, would contra- 
dict the received account. McVeigh’s manifest idiocy in fleeing the 
scene of the bombing in a car without license plates is a contradic- 
tory datum with respect to the official account of him as conspirator- 
ial ringleader capable of planning and carrying out such a terrorist 
operation. (The role of errant data will be discussed in more detail 
below.) 

These criteria go some way toward distinguishing UCTs from com 
spiratorial explanations that are less epistemically problematic. I am 
not concerned here with conspiracies to throw surprise birthday par- 
ties for friends or attempts by parents to deceive young children 
about the existence of magical beings.” Such day-to-day conspiracies 
typically do not meet all of these additional criteria. Surprise birthday 
parties are not organized for nefarious purposes, nor are they meant 
to be kept secret perpetually. These criteria, however, do not distin 
guish UCT from all conspiracies we are warranted in believing. Both 
Watergate and the Iran-Contra Affair meet all of these criteria, yet be- 
lief in these conspiracies seem prima facie warranted." There is no 
criterion or set of criteria that provide a priori grounds for distin- 
guishing warranted conspiracy theories from UCTs. The philosophi 
cal difficulties of UCTs require a deeper analysis, to which I now turn. 

IV 


Conspiracy theories are attractive, a fact demonstrated by their cur- 
rent popularity. But their alleged virtues are subtly flawed. I shall 


“I cannot help but wonder whether conspiracy's grip on Western thought can 
be blamed, at least in part, on the number of conspiracies we experience as chil 
dren. As we grow older, we discover just how many things adults have been system- 
atically lying to us about Santa Claus, the Faster Bunny, the Tooth Fairy, where 
babies come from, and so on 

! Some might balk at the “nefariousness” criterion. Presumably, North and his 
colleagues believed they were serving some higher good by their attempt to ar- 
cumvent Congress's prohibitons on funding Nicaraguan rebels. I do not wish to 
get into a debate over the semantics of ‘nefariousness’ in the Machiavellian context 
of politics. Suffice it to say that however one characterizes what went on the Iram 
Contra Affair and Watergate, they were not completely above board. If they were, 
secrecy would not have been so necessary. 
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show how certain elements of conspiracy theory appear to be those of 
legitimate theoretical explanation, but argue that on closer analysis, 
they are not. Therefore, I am trying to do two things: (1) explain 
why it is that UCT’s are so popular, but (2) explain why we should 
nonetheless fail to believe in them. They are popular, I suggest, be- 
cause they exhibit several well-known explanatory virtues. They are 
nonetheless unwarranted because they exhibit these virtues in ways 
that undermine the strength of those virtues. 

The first and foremost virtue which conspiracy theories exhibit, 
and which accounts for much of their apparent strength, is the 
virtue of unified explanation or explanatory reach. According to this 
virtue, all things being equal, the better theory is the one that pro- 
vides a unified explanation of more phenomena than competing ex- 
planations. Unified explanation is the sine qua non of conspiracy 
theories. Conspiracy theories always explain more than competing 
theories, because by invoking a conspiracy, they can explain both the 
data of the received account and the errant data that the received 
theory fails to explain. So, for example, in our case of the Oklahoma 
City bombing, conspiracy theories explain the data of the official 
story. On the account discussed above, the bombing was carried out 
by terrorists of which the BATF and other agencies were aware, but, 
due to sheer incompetence, failed to stop. This theory explains why 
and how the Murrah building came to be bombed. Furthermore, it 
explains the various errant data: for example, why no BATF person- 
nel were in the building (they were forewarned) and the bizarre be- 
havior of McVeigh (who was innocent of the crime and hence not 
expecting to be framed by federal agents in search of a scapegoat). 

This is the beauty of conspiracy theories. They offer wonderfully 
unified accounts of all the data at hand, both those the official story 
explains, plus those niggling, overlooked errant data. As J shall now 
try to show, however, UCTs obtain unity of explanation at too high a 
cost. 

The role of errant data in UCT is critical. The typical logic of a 
UCT goes something like this: begin with errant facts, such as the ob- 
servation that no BATF employees were injured in the Oklahoma 
City bombing and the early reports of prior warning. The official 
story all but ignores this data. What can explain the intransigence of 
the official story tellers in the face of this and other contravening evi 
dence? Could they be so stupid and blind? Of course not; they must 
be intentionally ignoring it. The best explanation is some kind of 
conspiracy, an intentional attempt to hide the truth of the matter 
from the public eye. 


120 THE JOURNAL OF PHILOSOPHY 


By invoking a conspiracy hypothesis, large amounts of “evidence” 
are thrown into question. This is one of the most curious features of 
these theories: to my knowledge, conspiracy theories are the only 
theories for which evidence against them is actually construed as evi- 
dence tm favor of them. The more evidence piled up by the author+ 
ties in favor of a given theory, the more the conspiracy theorist 
points to how badly “They” must want us to believe the official story. 

Let me note two things at this point. First, conspiracy theories are 
not alone in placing great emphasis on errant data. The history of 
science is replete with examples of theoretical innovation initiated 
by an investigation into data that did not fit the standard paradigm. 
It is a good pragmatic heuristic for scientific effort to be expended 
on chasing after errant data, in the hopes that these loose strings 
might lead to the unraveling of currently misguided theory. What 
conspiracy theories get wrong, however, is that the existence of er- 
rant data alone is not a significant problem with a theory. Given the 
imperfect nature of our human understanding of the world, we 
should expect that even the best possible theory would not explain all 
the available data. One’s theory should not fit all the available data, 
because not all the available data are, in fact, true.” Invariably, some 
of our measurements, some of our interpretations and other theo- 
ries get something wrong about the nature of the world. 

Second, the problematic of conspiracy theories goes beyond sim- 
ple false data. If the only problem with UCTs was that they place too 
much emphasis on small sets of data at odds with an official account, 
then that would not make them a very interesting phenomenon. 
Conspiracy theories differ from most other theories in one very in- 
teresting way, however. Conspiracy theorists would rightly point out 
that they have one problem with which scientists are not faced. By 
hypothesis, the conspiracy theorist is struggling to explain phenom 
ena that other, presumably powerful, agents are actively seeking to 
keep secret. Unlike the case of science, where nature is construed as 
a passive and uninterested party with respect to human-knowledge 
gathering activities, the conspiracy theorist is working in a domain 
where the investigated actively seeks to hamper the investigation. Imagine if 
neutrinos were not simply hard to detect, but actively sought to avoid 
detection! This is exactly the case with which conspiracy theorists 
contend we are confronted in the cases they seek to explain. This is 
why countervailing evidence and lack of evidence can and ought to 
be construed as supporting their theories. 


"T have heard this sentiment attributed to Francis Crick. 
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This brings me to the most commonly voiced complaint about 
UCTs, namely, that they are simply unfalsifiable. The worry is that 
given a situation where all potentially falsifying evidence can be con- 
strued as supporting, or at worst as neutral evidence, then conspiracy 
theories are by definition unfalsifiable. In favor of conspiracy theo- 
rists, it should be noted that this unfalsifiability is not as ad hoc as it 
might initially seem, due to the active nature of the investigated, just 
noted. It is not ad hoc to suppose that false and misleading data will 
be thrown your way when one supposes that there is somebody out 
there actively throwing that data at you. Just ask Kenneth Starr. As 
evidenced by any number of twentieth century, U. $. government- 
sponsored activities (take your pick), we have reason to believe that 
there exist forces with both motive and capacity to carry out effective 
disinformation campaigns. 

My claim here is that unfalsifiability is only a reasonable criterion 
in cases where we do not have reason to believe that there are power- 
ful agents seeking to steer our investigation away from the truth of 
the matter. Falsifiability is a perfectly fine criterion in the case of nat- 
ural science when the target of investigation is neutral with respect 
to our queries, but it seems much less appropriate in the case of the 
phenomena covered by conspiracy theories. Richard Nixon and 
North actively sought to divert investigations into their respective ac- 
tivities and both could call upon significant resources to maintain 
their conspiracies. They saw to it that investigators were thwarted in 
many of their early attempts to uncover what they accurately sus- 
pected was occurring. Strictly hewing to the dogma of falsifiability in 
these cases would have led to a rejection of conspiracy theories at 
too early a point in the investigations, and may have left the conspir- 
acies undiscovered. 

No, the problem with UCTs is not their unfalsifiability, but rather the 
increasing degree of skepticism required by such theories as positive ev- 
idence for the conspiracy fails to obtain. These theories throw into 
doubt the various institutions that have been set up to generate reliable 
data and evidence. In doing so, they reveal just how large a role trust— 
in both institutions and individuals—plays in the justification of our be- 
liefs. The problem is this: most of us—including those of us who are 
scientists and who work in scientific laboratories full of expensive 
equipment——have never carried out the experiments or made the em- 
pirical observations that support most contemporary scientific theories. 
Unless we want to conclude that the vast majority of us are not war- 
ranted in believing that the platypus is a mammal and that gold is an 
atomic element, we need some procedure by which the epistemic war- 
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rant obtained by those who do make the appropriate observations can 
be transferred to the rest of us. In modern science, this procedure in- 
volves the elaborate mechanisms of publication, peer review, profes- 
sional reputation, university accreditation, and so on. Thus, we are 
warranted in believing the claims of science because these claims are 
the result of a social mechanism of warranted belief production.” 

In the public sphere where conspiracy theories dwell, there are re- 
lated mechanisms for generating warranted beliefs. There is the free 
press, made up of reporters, editors, and owners who compete to pub- 
lish “the scoop” before others do. There are governmental agencies 
charged with investigating incidents, producing data, and publishing 
findings. And there are, of course, various “free agents” (including the 
conspiracy theorists themselves) who are members of the public. In- 
herent in the claim that alleged evidence against a theory should be 
construed as evidence for that theory is a pervasive skepticism about 
our public, fact-gathering institutions and the individuals working in 
them. Thus, as a conspiracy theory matures, attempt after attempt to 
falsify a conspiracy theory appears to succeed, and this apparent suc- 
cess must be explained as the nefarious work of the conspirators. As a 
result of this process, an initial claim that a small group of people is 
conspiring gives way to claims of larger and larger conspiracies. 

In the case of the Oklahoma City bombing, the initial conspiracy 
only involved the BATF agents and their immediate superiors. The 
FBI and other federal agencies were brought into the investigation, 
yet did not report a BATF conspiracy. Therefore, they must have 
been brought into the conspiracy. Ditto for certain members of the 
press who must have stumbled across evidence of the conspiracy, but 
who have not yet disclosed it in the national media. What began as a 
small conspiracy on the part of a few members of a paramilitary U. S. 
federal agency invariably swells into a conspiracy of huge propor- 
tions, as positive evidence for the alleged conspiracy fails to obtain. 
And, as more people must be brought into the conspiracy to explain 
the complicity of more and more public institutions, the less believ- 
able the theory should become. 


3 For more on the critical role of testumony in the epistemic process, see C. A. J. 
Coady, Testimony. A Philosophical Study (New York: Oxford, 1992). For more on as- 
eee on the soctal construction of warranted belief, see Helen Longino’s Scence as 

: Values and Objectruity m Sasntfic I (Princeton: University 
1990), and “The Fate of Knowledge in Socal Theories of Science,” m Fred- 
enck F Schmitt, ed., Sodakung Epistemology: The Social Dimensions of Knowledge (Lan- 
ham, MD: Rowan and Littlefield, 1994), pp. 185-57; and Kitcher’s “Socialinng 
Knowledge,” this JOURNAL, Lxxxvm, 11 (November 1991): 675-76, and “Contrasting 
Conceptions of Social Epistemology,” in Schmitt, pp. 111-34. 


OF CONSPIRACY THEORIES 123 


It is this pervasive skepticism of people and public institutions en- 
tailed by some mature conspiracy theories which ultimately provides 
us with the grounds with which to identify them as unwarranted. It is 
not their lack of falsifiability per se, but the increasing amount of 
skepticism required to maintain faith in a conspiracy theory as time 
passes and the conspiracy is not uncovered in a convincing fashion. 
As this skepticism grows to include more and more people and inst 
tutions, the less plausible any conspiracy becomes. 

Consider another famous UCT, the one claiming that the Holocaust 
never occurred and is a fabrication of Jews and their sympathizers. 
Robert Anton Wilson correctly notes that “a conspiracy that can deceive 
us about 6,000,000 deaths can deceive us about anything, and that it 
takes a great leap of faith for Holocaust Revisionists to believe World 
War U happened at all, or that Franklin Roosevelt did serve as President 
from 1933 to 1945, or that Marilyn Monroe was more ‘real’ than King 
Kong or Donald Duck.”* In the process of holding onto a belief in an 
increasingly massive conspiracy behind more and more public events, 
we undermine the grounds for believing in anything. At some point, we 
shall be forced to recognize the unwarranted nature of the conspiracy 
theory if we are to left with any warranted explanations and beliefs at all. 

V 


I want to take a step back and get a handle on the broader philo- 
sophical issues raised by conspiracy theories and the implication of 
their current surge in popularity. I contend that conspiracy theories 
embody a thoroughly outdated world view, a perspective on the 
meaning of life that was more appropriate in the last century. Recog- 
nizing this anachronistic element of conspiratorial thought is useful, 
however, if it reveals something about the contemporary Zeitgeist. 
Furthermore, the present popularity of conspiracy theories suggests 
that we are now in the grip of a conflict between world views. 
Conspiracy theorists are, I submit, some of the last believers in an 
ordered universe. By supposing that current events are under the 
control of nefarious agents, conspiracy theories entail that such 
events are capable of being controlled. In an earlier time, it would have 
been natural to believe in an ordered world, in which God and other 
supernatural agents exercised significant influence and control. With 
the rise of materialist science and capitalist economies—peaking in 
the eighteenth and nineteenth centuries—the notion of an ordered 
universe was still held to, but the role of the supernatural was either 


H “Beyond True and False: A Sneaky Quiz with a Subversive Commentary,” in 
Ted Schultz, ed., The Fringes of Reason (New York’ Harmony, 1989), pp. 170-73, 
here p 172. 
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greatly diminished (as in Deism) or eliminated (as in Marxism). As G. 
W. F. Hegel puts it in the passage quoted in the epigraph, “world his- 
tory has...been rational in its course.” Therefore, on this view, there is 
some hope that humans can understand, predict, and conceivably 
control the course of human events. This the conspiracy theorists be- 
lieve, only they further believe that the wrong folks are at the helm. 

Such beliefs are out of step with what we have generally come to be- 
lieve in the late twentieth century. The rejection of conspiratorial think- 
ing is not simply based on the belief that conspiracy theories are false as 
a matter of fact. The source of the problem goes much deeper. The 
world as we understand it today is made up of an extremely large num- 
ber of interacting agents, each with its own imperfect view of the world 
and its own set of goals. Such a system cannot be controlled because 
there are simply too many agents to be handled by any small controlling 
group. There are too many independent degrees of freedom. This is 
true of the economy, of the political electorate, and of the very social, 
fact-gathering institutions upon which conspiracy theorists cast doubt. 
Even if the BATF were part of a large conspiracy to cover up their in- 
competence in the Oklahoma City bombing, it is implausible to believe 
that not a single member of the BATF stationed in Oklahoma would be 
moved by guilt, self-interest, or some other motivation to reveal that 
agency's role in the tragedy, if not to the press, then to a lover or family 
member. Governmental agencies, even those as regulated and con- 
trolled as the military and intelligence agencies, are plagued with leaks 
and rumors. To propose that an explosive secret could be closeted for 
any length of time simply reveals a lack of understanding of the nature 
of modern bureaucracies. Like the world itself, they are made up of too 
many people with too many different agendas to be easily controlled. 

The rejection of the conspiratorial world view, however, is not 
something about which I am particularly thrilled. If conspiracy theo- 
ries are genuinely misguided, then I fear we are left with an appar- 
ently absurdist image of the world. A lone gunman can change the 
course of history when the U. S. President just happens to drive past 
the window of his place of work during the gunman’s lunch hour. 
The conspiratorial world view offers us the comfort of knowing that 
while tragic events occur, they at least occur for a reason, and that 
the greater the event, the greater and more significant the reason. 
Our contemporary world view, which the conspiracy theorist refuses 
to accept, is one in which nobody—not God, not us, not even some of 
us—is in control. Furthermore, the world (including the people in 
it) is uncontrollable, irrational, and absurd in a way illustrated by the 
plays of Eugene Ionesco and Samuel Beckett. 
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It is not that the rejection of conspiratorial thinking entails that 
the world is random, only that it is without broad meaning and signif- 
icance. One can reject conspiracies about Kennedy’s assassination 
and still hold that Oswald's behavior was caused and fully determin- 
istic in ways on which science can get a grip. Such a scientific under- 
standing of Oswald’s behavior is absurdist, however, in just the sense 
I intend here. There was no human or rational agency responsible 
for his behavior on such account, only the blind workings of brain 
chemicals and childhood emotional traumas. Events like Kennedy’s 
assassination and the Oklahoma City bombing have had an enor- 
mous emotional and meaningful impact on our world; indeed, they 
represent singularities after which American culture has never been 
the same. Conspiracy theories have as a virtue the attempt to pre- 
serve a human meaning—a rational accounting——for these sea 
changes which allows them to be understood in human terms. If the 
conspiracy theorists are wrong, then when considered from the hu- 
man perspective, all we seem to be able to say is that “Shit happens.” 

Considered in this light, the challenge of conspiracy theory is that 
it forces us to choose between an almost nihilistic degree of skepti- 


cism and absurdism: the conspiracy theorist chooses to embrace the ~ 


hyperskepticism inherent in supposing dissimulation on@1 truly mas- 
sive scale (by distrusting the claims of our-institutions) over the ab- 
surdism of an irrational and essentially meaningless world. Until a 
third option is presented—and perhaps this is one of the jobs of phi- 
losophy—we should expect UCTs to continue enjoy significant pop- 
ularity. 
VI 

What lessons should we take away from these musings on the dynam- 
ics of conspiracy theory? The story I have told here seems to point to 
three main, related morals. 

First, the folly of conspiracy theories highlight the fact that we 
should not be overly bothered when our theories of social events fail 
to make sense of all the data. Invariably, not all the data are true. In 
particular, the irrational and fallible nature of humans should lead us 
to expect that some of the data generated by us are certainly wrong. 
Witnesses misremember the past or exhibit unconscious biases. Re- 
porters and government agents will get things wrong in the early mo- 
ments of a crisis, and will later be loathe to admit those mistakes. For 
this reason, a theory that has as one of its main features a unified ac- 
count of all of the data in a variety of seemingly unrelated occurrences 
should be called into question on those grounds alone. We expect our 
explanations to be good, but we also expect them to be imperfect. 


—_ 
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Second, we should be careful not to overrationalize the world or 
the people that live in it. Rejecting conspiratorial thinking entails ac- 
cepting the meaningless nature of the human world. Just as with the 
physical world, where hurricanes, tornadoes, and other “acts of God” 
just happen, the same is true of the social world. Some people just do 
things. They assassinate world leaders, act on poorly thought out ide- 
ologies, and leave clues at the scene of the crime. Too strong a belief 
in the rationality of people in general, or of the world, will lead us to 
seck purposive explanations where none exists. 

The third lesson of conspiracy theories is that we ought to recog- 
nize such theories as embodying an almost nihilistic degree of skept 
cism about the behavior and motivations of other people and the 
social institutions they constitute. To the extent that a conspiracy 
theory relies on a global and far-reaching doubt of the motives and 
good will of others, it is akin to global philosophical skepticism. 
These extreme skeptical stances should be dealt with in the same 
way. We should be wary of theoretical accounts that expend more 
energy undermining the epistemic warrant of competing explana- 
tions than on generating new, positive evidence. 

So, in the end, what do I think of conspiracy theories? My initial mo- 
tivation was to present an analysis of conspiracy theories in the spirit of 
Hume’s analysis of miracles. For Hume, miracles are by definition expla 
nations that we are never warranted in believing. If my analysis here is 
correct, however, we cannot say the same thing about conspiracy theo- 
ries. They are not by definition unwarranted. (A good thing given that 
we want to believe in at least some conspiracies— for example, Water- 
gate and Iran-Contra.) Instead, I suggest that there is nothing straight- 
forwardly analytic that allows us to distinguish between good and bad 
conspiracy theories. We seem to be confronted with a spectrum of cases, 
ranging from the believable to the highly implausible. The best we can 
do is track the evaluation of given theories over tme and come to some 
consensus as to when belief in the theory entails more skepticism than 
we can stomach. Also, I suspect that much of the intuitive “problem” 
with conspiracy theories is a problem with the theorists themselves, and 
not a feature of the theories they produce. Perhaps the problem is a psy 
chological one of not recognizing when to stop searching for hidden 
causes. Nonetheless, I suggest that the study of conspiracy theories, even 
the crazy ones, is useful, if only because it forces us clearly to distinguish 
between our “good” explanations and their “bad” ones. 

BRIAN L. KEELEY 
Washington University/St. Louis 
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ne of the favorite argumentative methods of present-day an- 

alytic philosophers is to appeal to intuitions. I argue here 

that such appeals are usually without any respectable theo- 
retical foundation. Moreover, the best way of justifying the use of in- 
tuitions in philosophical argumentation entails radical changes in 
our ways of thinking about intuitions. 

Where does the current popularity of appeals to intuition come 
from? The timing of the great revival of intuitionist methodology 
gives us a clue to its causes. Before the early 1960s, you could 
scarcely find any overt references, let alone appeals, to intuitions in 
the pages of philosophical journals and books in the analytical tradi 
tion. After the mid-1960s, you will find intuitions playing a major 
role in the philosophical argumentation of virtually every article or 
book. Why the contrast? 

The answer is simple. Intuitions came into fashion in philosophy 
as a consequence of the popularity of Noam Chomsky’s linguistics 
and its methodology. According to a widespread conception, genera- 
tive linguists like Chomsky were accounting for competent speakers’ 
intuitions of grammaticality by devising a grammar, that is, a set of 
generative rules that produces all and only such strings that are inti 
itively accepted by these speakers. This kind of methodology was 
made attractive by the tremendous perceived success of Chomsky’s 
theories in the 1960s and 1970s. Not only was transformational gram- 
mar the dernier cn in linguistics, it was seen as a major revolution in 
the study of language. What is more, it was taken to provide a 
methodological paradigm of what can be done in those fields where 
the subject matter involves the tools of human thought and cogni- 
tion. The use of intuitions in philosophical argumentation thus orig- 
inated from philosophers’ attempt to get on the bandwagon of 
transformational grammar. 

This bandwagon effect was amplified by the example of some well- 
known philosophers. An influential case in point was Saul Kripke.! 
As early as 1963-1964, he was led by his “natural intuition” to the 
view “that the names of ordinary language are rigid designators” 
(id, p. 5). Furthermore, he acknowledges himself that his argu- 
mentation for this view rests on intuition: “In these lectures, I will ar- 
gue, intuitively, that proper names are rigid designators...” (sdd., p. 


| Nameng and Necessity (Cambridge: Harvard, 1980), p. 5. 
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49). Kripke also argues as if his main problem were the correct inter- 
pretation of these intuitions; for instance, whether they can be expli- 
cated so as to pertain to the relative scopes of different logically 
active notions (thid., p. 10), or whether Michael Dummett or Kripke 
himself has the right view on “linguistic intuitions.” Quite obviously, 
Kripke helped to give the intuitionist bandwagon a mighty push. Un- 
fortunately, his reliance on intuitions in defending his idea of rigid 
reference is apt to give intuitions a bad name. I shall return to his 
use of intuitions later. 

In reality, the linguistic parentage of contemporary philosophers’ 
intuitionist methodology nevertheless constitutes a strong reason to 
be wary of it. Ironically, the attribution of such a mode of argumen- 
tation to Chomsky is at best a halftruth. Speakers’ intuitions first en- 
tered into his thinking as an antidote to oversimplified views 
concerning the ways in which a grammar could, in principle, be con- 
structed for a previously unstudied language from a corpus of ob- 
served utterances. One of the goals of some of his linguistic 
predecessors had been what Chomsky called decision procedure, that is, 
a systematic method of choosing among different grammars, each of 
which purports to account for a given corpus of utterances. He ar- 
gues against the feasibility of such a method and, among other 
things, adduces intuitions as alternative grounds of comparisons 
among different grammars. According to John Lyons:? 


In Syntactic Structures* he [Chomsky] says that the sentences generated 
by the grammar should be “acceptable to the native speaker,” and he 
considers that it is a point in favor of the kind of grammar he develops 
that it also accounts for the “intuitions” of native speakers with respect 
to the way certain sentences are recognized as equivalent or ambiguous 
(op. cit, pp. 32-33). 


Indeed, the first explicit proposal of an intuition-based grammar is 
not found in Chomsky but in the famous (or notorious) review of 
Syntactic Structures by Robert B. Lees,* who elevates one’s “feeling for 
language”—or at least feeling for the syntax of language—to the sta 
tus of “the empirical basis of grammatical analysis”: 


..as we see from the practice of linguists, though unfortunately not 
from their own descriptions of linguistic methodology, the criterion ac- 
tually used in all crucial cases is either the informant’s response in care- 
fully designed...elicitation techniques, or else the linguist’s own 


t Noam Chomsky (New York: Penguin, 1977, revised edition). 
* Mouton: The ae 1957; see pp. 49-50. 
1 Language, XXX (1957): 375-407 
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Sprachgefihl is called upon to provide correct analysis, after which any 
ad-hoc rule may be devised to designate the results (shd., p. 309). 


Lees spells this out further in a footmote: 


It is precisely this Sprachgefuhl, this mfutue notion about linguistic 
structure, which together with the sentences of a language, forms the on- 
pirical basis of grammatical analysts and it is precisely the purpose of lin- 
guistic science to render explicit and rigorous what ever is vague about 
these linguistic feelings (ibid., p. 309, fn. 40; emphases added). 


Any linguist or any philosopher who is familiar with the subsequent 
argumentative practice in her field can only have here a feeling of 
déjà vu all over again. What is going on is obviously that Lees was try- 
ing to align Chomsky’s methodology with that of empirical sciences 
(such as Lees’s own first métier, chemistry), with intuitions playing 
roughly the same role as empirical observations. 

Chomsky’s willingness to countenance Lees’s construal of his 
methodology amounted to a significant change of mind on his part; 
for Chomsky’s original strategy did not involve appeals to intuition. 
In The Logical Structure of Linguistic Theory,’ he writes that a linguistic 

..theory is significant just to the extent that we can demonstrate that 

the data do have the prescribed form, that is, to the extent that we have 

unambiguous, cross-culturally valid tests for applying the undefined no- 
tions to data. Thus if one of the basic undefined terms of linguistic the- 

Ory is “intuition,” and if we define phonemes in this theory as elements 

which our intuition perceives in a language, than the notion of 

phoneme is as clear and precise as is “intuition.” ...It should be clear, 
then, why the linguist interested in constructing a general theory of lin- 
guistic structure, ın justifying given grammars or (to put the matter in 
its more usual form) in constructing procedures of analysis should try 

to avoid such notions as “intuition” (iid, pp. 86-87). 

Admittedly, Chomsky himself later moved closer to an intuitionist 
methodology. In his later work, according to Lyons, “Chomsky in- 
cludes intuitions of the speakers of a language as part of the data to 
be accounted for by the grammar” (op. cit, p. 33). Chomsky also 
quickly qualified the role of intuitions in the methodology of linguis- 
tics by introducing the distinction between competence and 
performance. But this distinction is highly questionable and, pre- 
dictably, it did nothing to cool philosophers’ ardor in their appeals 
to intuitions. For by the token of Chomsky’s distinction, to hesitate 
in one’s appeals to one’s intuitions would have been ipso facto to 
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doubt one’s own linguistic or, more generally, conceptual compe- 
tence. 

What is there to be said about philosophers’ toeing the Chom- 
skian party line? The most amazing fact about the current fashion of 
appealing to intuitions is the same as the proverbial dog’s walking on 
two feet: not that it is done particularly well but that it is done at all. 
For what is supposed to be the justification of such appeals to intu- 
ition? One searches the literature in vain for a serious attempt to 
provide such a justification. 

This blind faith is below the intellectual dignity of philosophers 
for whom unexamined intuitions should not be worth intuiting. It is 
also in stark contrast to the procedure of earlier philosophers. Every 
major philosopher or philosophical tradition whose argumentation 
relied on intuitions (or their equivalent) provided an explanation 
why, according to their lights, intuitions could provide the right kind 
of information or insight. 

For Aristotle, for instance, to think of X is to realize the form of X 
in one’s soul. This instantiation of the form is quite as authentic as 
an instantiation outside the soul. The instantiated form in the soul 
has the same powers and the same necessary links to other forms as 
an instantiation outside the soul. Hence I can find all about this 
form “intuitively” simply by realizing it in my mind. As one might put 
it, thought experiments for Aristotle were literally real experiments. 
Because of this, the immediate awareness of what there is in my soul, 
called by Aristotle notis, could give me the first premises of a science. 

It is hard to think of a better justification for appeals to intuition. 
Contemporary philosophers do not believe in the reality of forms, 
however, no matter whether they are supposed to be instantiated in 
the mind or in the world, nor have any earlier philosophers done so 
since the days of the medieval nominalists. Our contemporaries do 
not have the courage of their intuitions and claim infallibility for 
them, as Aristotle did. Their use of intuitions as prima facie evidence 
has nothing to do with Aristotle’s nous. If they want to find an Aris- 
totelian precedent to their methodology, the obvious candidate is 
not Aristotle’s reliance on noûs, but his use of reasonable common 
opinions, endoxa, as starting points of his dialectical arguments. The 
resemblance between them and twentieth-century philosophers’ “in- 
tuitions” is uncanny, when it comes to the role they are supposed to 
play in philosophical argumentation. But I doubt that anyone before 
Kripke would have called such hunches “intuitions.” 

But even many postnominalist philosophers had a theory or theo- 
ries to back up their appeals to intuitions. The best-known form of 
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such theories is the doctrine of innate ideas of the rationalists. An 
enterprising historian of ideas might try to trace this line of defense 
of intuitive knowledge to Plato’s anamnesis. Be that as it may, it was 
the doctrine of innate ideas that provided the rationale for appeals 
to intuition (whether under this title or not) by rationalists like René 
Descartes. 

When both Aristotelian forms and innate ideas became discred- 
ited, no respectable justification for intuition as a separate source of 
insights remained. Accordingly, by the eighteenth century the no- 
tion of “intuitive knowledge” became watered down and ended up 
meaning little more than immediate knowledge. When John Locke 
cites as a paradigmatic example of intuitive truths that one plus two 
equals three, he is not postulating a separate mental faculty of intu- 
ition or even a separate source of knowledge. This minimal sense of 
“intuition” has played a much more important role in subsequent 
philosophical terminology than is generally realized. For instance, 
interpreters like Emmanuel Levinas® who see in Husserlian Anschau- 
ung a special kind of “theoretical act of consciousness that makes ob- 
jects present to us” might be well advised to consider the ancestry of 
Edmund Husserl’s notion. 

Part of this ancestry is Immanuel Kant’s theory of how intuitions, 
even in a minimal sense of immediate representations of particulars, 
can yield synthetic knowledge a priori. His explanation turns on the 
idea that in sense perception we impose a certain relational struc- 
ture on experience, namely, the structure of space and time. Hence 
we can recover the relations we have ourselves imposed on our expe- 
rience by reproducing them in imagination. What is remarkable 
here is that this Kantian justification of the use of intuitions in math- 
ematics does not presuppose any particular mental faculty of inte 
ition nor any sense of “intuition” stronger than Kant’s own definitory 
unmittelbare Vorstellung von dem Exnzeinen. 

Now, what conceivable theoretical rationale do contemporary 
philosophers’ appeals to intuitions have? The embarrassing answer 
is: none of the above. The vast majority of philosophical writers 
these days take the name ‘intuition’ in vain since they do not be- 
lieve in Platonic anamnesis, Aristotelian forms, Cartesian innate 
ideas, or Kantian transcendental deductions. Nor can they find aid 
and comfort in the other, minor source of allegedly intuitionist 
methodology in twentieth-century philosophy. G. E. Moore’s argu- 
mentation is often labeled “intuitionistic.” If so, the term ‘intuition’ 


* The Theory of Intuition tn Hussert’s Phenomenology, André Orianne, trans. 
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is used in the eighteenth-century minimal sense in which the phrase 
‘intuitive knowledge’ is merely an euphemism for ‘immediate 
knowledge’. Moore’s method is predicated on his belief that in any 
experience we can distinguish the immediately given object of expe- 
rience from the experience itself as a subjective happening that is 
merely part of one’s consciousness. His appeals to “intuition” are 
therefore in reality appeals to what is objectively given to me in im- 
mediate experience. Not only is such a notion of “intuitively given” 
independent of any assumption of a special mental faculty of intu- 
ition, it presupposes that there is no such special faculty. 
Mooresque intuitively given objects are given in whatever experi- 
ences it is that we normally can have, not in special experiences 
called intuitions. 

But few contemporary philosophers are likely to swallow Moore’s 
“Refutation of Idealism”’ hook, line, and sinker so as to have the 
same rationale for their appeals to intuition. Nor are they likely to 
claim that they have solved the problems to which Moore’s method- 
ology led him and his friends when Bertrand Russell and others 
raised the question as to what the immediately known objects of ex- 
perience are, for instance, in our sense experience. Sense data? But 
what precisely are sense data? Surfaces of physical objects? States of 
one’s central nervous system? The search for such immediately given 
objects in one’s experience ended up being truly the longest philo- 
sophical journey of the Bloomsbury philosophers. 

Thus, contemporary thinkers’ practice of appealing to intuitions 
in philosophical argumentation is without any justification whatso- 
ever—in most cases. One of the interesting exceptions is the main 
character of the story of twentieth-century intuitions, Chomsky. Un- 
like his contemporaries, Chomsky could have a good intellectual 
conscience in appealing to grammatical intuitions, for he is a self 
acknowledged Cartesian. He believes in innate ideas, at least in the 
form of an innate universal grammar. Thus, apart from details, 
Chomsky had, and presumably has, up his sleeve the same justifica- 
tion for appeals to intuitions as someone like Descartes. 

Even though the matter needs closer analysis, it seems that Chom- 
sky’s emphasis on intuitions grew in tandem with his keener aware- 
ness of the Cartesian character of his own linguistic thinking. Later 
on, an opposite change seems to have taken place in Chomsky’s 
views in that he has come to put less stress on innateness and univer- 
sal grammar and, accordingly, less stress on intuitions. 


* Mend, N.S. xu (1908): 433-53, reprinted In Moore, Philosophical Studies (Lon- 
don. Routledge, 1922), pp. 1-30. 
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Of course, Chomsky’s Cartesianism was originally unknown to 
the majority of philosophers. He came out of the closet only in 
his Cartesian Linguistics.? And when he did, he used the rationalis- 
tic line of thought to defend his idea of deep structure rather 
than his intuitionist methodology. Hence it is perhaps not sur- 
prising that the rationale of Chomskian appeals to intuition has 
not been recognized by philosophers. Philosophers’ surprise and 
dismay at the discovery of Chomsky’s rationalism is in evidence in 
the historical Chomsky-Putnam-Goodman symposium on innate 
ideas.’ 

Even though Chomsky later enriched his views in a variety of ways 
and although he does not usually employ the term ‘intuition’, his 
position has remained basically the same. For instance, in Knowledge 
of Language,’ he still postulates “a particular faculty of the mind,” 
the “language faculty.” Moreover, speaker’s intuitions, now re-named 
“the judgments of native speakers,” provide access to that faculty: 
“To be sure, the judgments of native speakers will always provide reb 
evant evidence for the study of language” (1bid., p. 37). The main dif 
ference is that there are “many different sources apart from 
judgments concerning the form and meaning of expressions” (ibid, 
pp. 36-37). But this does not really cast any doubts in Chomsky’s 
mind on competent speakers’ intuitions (“judgments”) about lan- 
guage. The only qualification Chomsky registers is that such judg- 
ments may be disturbed by other factors: “judgments of 
acceptability, for instance, may fail to produce direct evidence as to 
grammatical status because of the intrusion of numerous other fac- 
tors” (ibid., p. 36). 

Whether this Cartesian justification of Chomsky’s relinnée on int 
itions is in the last analysis valid or not, it is not available to that great 
majority of analytic and nonanalytic philosophers who are not Carte- 
sians. Since they do not believe in Aristotelian forms or in Kantian 
transcendental expositions or deductions as state-of-the-art philo- 
sophical truths, their use of intuitions in philosophical argumenta- 
tion is totally without justification. 

Chomsky’s reliance on intuitions is a better tactic than philoso- 
phers’ parallel ploy for two other reasons. First, linguists’ intuitions 
pertain to the human language ability and its products. In contrast, 
philosophers’ intuitions do not pertain to the supposed faculty of in- 
tuition itself but to the truths about which this faculty is supposed to 
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provide knowledge. Hence Chomskian intuitions are not subject to 
the same epistemological problems as philosophers’. 

Second, Chomsky wisely restricts the intuitions mainly, albeit not 
exclusively, to one kind only, namely, to intuitions of grammaticality. 
Admittedly, it turns out, Chomsky notwithstanding, that not even all 
our intuitions of well-formedness have the kind of generative expla 
nation Chomsky prefers." But linguists are not normally exposed to 
misdiagnosed intuitions to nearly the same extent as philosophers 
are. 
The example of generative grammarians has been misleading in a 
more specific respect as well. Typical linguists’ intuitions have per- 
tained to the grammaticality of sentences. Linguists have not asked 
about people’s intuitions as to how that grammaticality might be 
tested. Philosophers’ intuitions have typically pertained to the truth 
or acceptability of various propositions. They have not usually in- 
quired into our intuitions as to how that truth or that acceptability is 
established. Yet an answer to the latter question is likely to be more 
easily forthcoming and more confident than an answer to the for- 
mer. I doubt that many philosophers can honestly claim to have 
sharp intuitions as to which one of the several so-called theories of 
truth is itself true, but everyone will agree that in order to ascertain 
that (dx) S[x], one should find an individual b such that 5[8]; that in 
order to ascertain that it is the case that (Vx) (dy) S[ x,y], one must be 
able to find, given any individual d, an individual b such that S[4d,d); 
and so on. 

Philosophers’ mistake in adopting the intuitionist interpretation 
of Chomsky’s methodology was compounded by their blind accep- 
tance of an oversimplified construal of another aspect of that 
methodology. That aspect is the notion of explanation relied on by 
transformational grammarians. Most analytic philosophers seem to 
think these days that the main role of their intuitions is to serve as 
raw material for a generalization. Conversely, a successful generaliza- 
tion—that is, one that captures all the different data provided by in- 
tuition—serves to explain those data. 

Once again, Chomsky is not a Chomskian, any more than Karl 
Marx was a Marxist. There is not a trace of the idea of generaliza- 
tion-as-explanation in his early methodological reflections. In his 
early thinking, paradigm cases of linguistic explanation included ac- 
counts of the structural ambiguity of a sentence (like the proverbial 
‘Visiting relatives can be boring’) by pointing that it can be gener- 


H See here my “On the Anythens and the Methodology of Linguistics,” Lingus- 
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ated in two different ways. In his later writings, he emphasizes such 
indicators as deductive depth as hallmarks of good explanations at 
the expense of subsumption under a generalization. 

It is nevertheless possible to argue that Chomsky’s own theories 
rely far too much on straightforward generalization from examples 
(particular cases). For instance, the successive versions of the govern- 
ment-and-binding theory seem to have arisen from their predeces- 
sors, not through any deeper analysis of the interplay of different 
factors affecting the conditions of coreference, but merely through 
adjustments in the relevant generalizations calculated to eliminate 
counterexamples that had meanwhile been discovered. I have ar- 
gued that such a tactic reduces greatly the theoretical appeal of gen- 
erative linguists’ theories.” 

This observation has even some direct methodological signifi- 
cance for linguistics. As I have also argued elsewhere, transforma- 
tional grammarians and other contemporary linguists would do a 
much better job if, instead of relying on our intuitions about isolated 
examples, they tried to vary systematically suitable ingredients in 
some sample sentence and observed how our “intuitions” change as 
a consequence. Now we can see why such systematic variation is a 
way of persuading our intuitions to yield general truths (dependence 
relations) rather than particular cases. 

More generally, reliance on generalization from particular cases is 
foreign to the methodological spirit of modern science, which origi 
nated by looking for dependences of different factors in instructive 
particular cases (often in controlled experimental situations), and by 
studying these dependences by the same mathematical means as a 
mathematician uses in studying the interdependences of different 
ingredients of geometrical figures in analytic geometry. 

In any case, philosophers’ reliance on the idea of explanation-by- 
generalization is rife with historical irony. The best-known philo- 
sophical explication of such an idea of explanation—Car]l G. 
Hempel’s “covering-law” model of explanation—was generally re- 
jected, at least verbally, around the same time as the intuitionist 
methodology became popular among them." It has been said that 
the covering-law model is dead, but in reality it is alive and popular 
among linguists and among philosophers who are wittingly or unwit- 
tingly imitating linguists. 


2 See here Gabnel Sandu and my On the Methodology of Linguistics. A Case Study 
(Cambndge: Blackwell, 1991), chapters 45. 

4 See here Ilpo Halonen and my “Toward a Theory of the Process of Explana- 
tion” (forthcoming). 
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A closer study of Chomsky’s argumentation should, in fact, have 
discouraged philosophers from generalizing his assumptions beyond 
linguistic intuitions. In his famous review of B. F. Skinner, Chomsky"* 
argues for the necessity of assuming a universal grammar by claiming 
that without such an assumption, we cannot explain children’s ca- 
pacity for language learning. The argument is that ordinary presup- 
positionless inductive learning is far too slow to account for the 
speed of children’s language acquisition. But by the same token, it 
should be necessary for a naturalistic epistemologist to assume 
strong innate ideas in order to account for the possibility of theory 
formation by rational scientists. For it is a striking feature of the dis- 
covery of many important scientific theories that they relied initially 
on rather few instances from which they were generalized. An induc- 
tive logic that would allow such steps as inductive generalizations 
presupposes either strong background assumptions or a high degree 
of a priori faith in the orderliness of nature. 

Some naturalistic epistemologists, most prominently W. V. Quine, 
have acknowledged the need (for their purposes) of some innate 
principles that could serve to give direction to inductive inferences. 
For instance, Quine speaks occasionally of innate quality spaces that 
govern our judgments of similarity. But such Quinean innate ideas 
are obviously far too few and far apart to provide a basis for a realis- 
tic theory of scientific reasoning. 

Another dubious ingredient in the Chomskian use of intuitions is 
the tacit assumption that our basic intuitions about the particular ex- 
amples to which they pertain are unconscious in the sense that we 
are not aware of the reasons why we accept or reject them. This fea- 
ture of Chomskian intuitions is not unrelated to their being re- 
stricted to particular cases. The reasons for which we accept and 
reject particular cases, consciously or unconsciously, would realisti- 
cally speaking have to include some general grammatical rules or 
principles. But if so, we could become aware of the general rules or 
principles of grammar directly, without going by way of induction 
from particular cases, which Chomsky does not allow. 

A closer analysis of the nature of scientific explanation confirms 
these suspicions of contemporary philosophers’ methodology. It 
turns out that, even though there are covering laws hidden in all 
why-explanations, the actual process of explanation does not consist 
in subsuming data under such a law. It consists in deriving the ex- 
planandum from an established background theory plus suitable ad 
hoc (or perhaps better ad explanandum) observations. A scientific 
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explanation without a pre-existing theory is an oxymoron.” Hempel 
does receive a qualified vindication, however. It turns out that under 
a wide range of circumstances, the success of an explanation guaran- 
tees the existence of a covering law. But to explain is not to find the 
covering law, and the covering law is not the explanation. 

But where do such theories come from? Here we encounter one 
of the most debilitating weaknesses of contemporary philosophers’ 
reliance on intuitions. For it is generally—though not universally— 
thought that intuition, like sense perception, always deals with par- 
ticular cases, however representative. Some such restriction is 
implicit in all analogies between intuition and sense perception. But 
if so, intuitions alone cannot yield the general truths: for instance, 
general theories for which a scientist and a philosopher is presum- 
ably searching. Some kind of generalizing process will be needed, be 
it inductive inference, abduction, or a lucky guess. 

This restriction of the data allegedly provided by intuition to par- 
ticular truths is an instance of the more general assumption that I 
have called the atomistic postulate. According to it, the basic input into 
our epistemic process consists of particular data, excluding general 
truths. I have argued that it is the mistake that underlies much of 
current epistemology and philosophy of science. It is by any judg- 
ment an especially debilitating aspect of recent philosophers’ intw 
itionistic methodology. Even if there are intuitions concerning 
particular cases, we are not any wiser because of them. 

A defender of the use of intuitions in philosophy asked me in dis- 
cussion: “What is wrong with starting from intuitively given obvious 
particular cases in philosophical argumentation?” I replied: “What 
would be an example of such obvious truths?” My interlocutor 
replied: “Surely, everybody agrees, say, that it is wrong to torture a 
child for pleasure. What is wrong with starting from such universally 
accepted special cases and using them as a basis for generalizations, 
in this case ethical ones?” 

In fact, I could not have imagined a better illustration of my own 
point. For what is the generalization implicit in this particular case? 
This question is tantamount to the question: Which parameters of 
the given proposition are the ones with respect to which we should 
generalize? And asking this obviously amounts to asking: What are 
the factors affecting our judgment about the situation envisaged in 
it? Is it sometimes all right to torture a child for reasons other than 
pleasure? If so, what reasons? Is it ever permitted to torture an adult? 


4 See foomotes 11 and 13 above. 
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If so, what is the critical age? Is it ever acceptable to torture an adult 
for reasons other than pleasure? If so, what reasons? Obviously, my 
interlocutor’s “Intuition” does not throw any real light on one’s ethi 
cal principles. 

In order to be useful, intuitions must have some kind of at least 
implicit generality. For instance, assume that I judge a gift not to be 
ethically meritorious because it was given merely to satisfy the recipi 
ent’s expectations. In such a judgment, there is an implicit ethical 
principle involved in that the judgment presupposes that a necessary 
condition for an act to be meritorious is that it be spontaneous. 

The idea that intuitions pertain to particulars, not to general 
truths, is deeply entrenched in the history of the idea. It is implied in 
the traditional consciousness by the analogy between intuition and 
sense perception; for the majority of philosophers used to follow 
Aristotie’® in thinking it is sense perception that deals with particu- 
lars. Accordingly, such “intuitive” methods as the use of ekthesis in 
logic were viewed with suspicion. It is interesting to see that thinkers 
as late as Gottlob Frege rejected all appeals to intuition in logic be- 
cause intuitions are about particulars, not about the general truths 
logic deals with. 

This misleading idea that intuition deals with particular cases was 
strongly encouraged by what was perceived as Chomsky’s methodol 
ogy. For the intuitions he recommended linguists to start from were 
intuitions concerning the grammaticality of particular strings of sym- 
bols, not concerning general rules of grammar. 

More generally, what is wrong with twentieth-century philoso- 
phers’ use of their alleged intuitions is not so much that they are 
wrong as that they are limited in their applicability and that their 
presuppositions are not recognized and spelled out. In this respect, 
Kripke’s historically influential use of his presumed intuitions offers 
a telling example. As usual, the problem is not that there is nothing 
to a philosopher’s intuitions. But intuitions in the sense Kripke talks 
about them come cheaply—at least as cheaply as the endoxa from 
which Aristotle typically starts his argumentation. Indeed, Aristotle’s 
use of endoxa offers an interesting object of comparison for contem- 
porary philosophers’ handling of their “intuitions.” Aristotle realized 
what Kripke and most others like him have never realized, namely, 
that endoxa or prima facie intuitions do not come fully equipped - 
with instructions for their use. They are not premises for philosophi 
cal arguments; they are raw material to be critically weighed, cor- 
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rected, and integrated into a coherent view. Their presuppositions 
have to be uncovered and their tacit limitations recognized before 
such integration is possible. This is what most of Aristotle’s own 
philosophical argumentation amounts to. 

Such a critical scrutiny of our prima facie intuitions typically 
amounts to re-educating our endoxa so as to do better justice to the 
realities of the conceptual situation. One of the most crucial failures 
of contemporary philosophers has been that they have not realized 
the need of re-educating and re-regimenting even some of their 
most sacred intuitions. In this respect, an instructive and clear-cut 
test case is offered by the intuitions that have prompted the formula- 
tion of the most basic laws of our basic logic, the logic of quantifiers. 
A number of such intuitions were regimented by Frege and Russell 
into the received logic of quantifiers. It has taken more than one 
hundred years to realize that that regimentation was too narrow in 
that it excluded certain perfectly possible and intuitively inter- 
pretable structures of quantifiers.’ The resistance that this simple 
observation has encountered 1s unmistakably and undoubtedly due 
to the fact that it involves a partial re-education of our “intuitions” 
about quantifiers. In particular, it involves the realization that the 
full modeltheoretical meaning of quantifiers is not exhausted by the 
“intuitive” idea of their “ranging over” a class of values. 

In a different direction, many philosophers still have not acknowl- 
edged that the intuitions on which some of the familiar rules of in- 
ference are based are not operative in modal and intensional 
contexts. Logicians’ intuitions seemed to tell them, for instance, that 
existential generalization is a valid mode of inference, that is, that 
from a sentence of the form S[b] you can always infer (Ax) S[x]. But 
this ‘always’ is mistaken. If you confront a philosopher with a specific 
example, he will readily confess to different intuitions. For instance, 
the intuitions on which existential generalization is based could 
never support an inference from: 


(1) Ari knows that Homer was Homer. 
to: 

(2) There is an indvidual such that Ari knows that he was Homer. 
which obviously says the same as: 

(2)* Ari knows who Homer was 


" For a fuller explanation, see Sandu and my “A Revolution in Logic?” Nordic 
Journal of Philosophical Logic, 1 (1996): 169-83. 
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It is equally clear “intuitively” that an inference from (1) to (2)* is 
valid only if ‘Homer’ picks out the same individual in all the relevant 
possible scenarios—in other words, only if Ari already knows who 
Homer is. 

In general, existential generalization with respect to a singular 
term presupposes that that term refers to the same individual in all 
the relevant worlds. This is acknowledged, in effect, by the likes of 
Kripke, who maintains that existential generalization with respect to 
a proper name is always valid. This he can do because his alleged in- 
tuitions include the idea that proper names refer to their objects 
necessarily, that is, in every possible world. This is what he refers to 
as rigid designation. In other words, it is not possible that they 
should refer to one object in one world and to another in another 
world. In some sense, this intuition can be granted to Kripke. But he 
never recognizes its tacit limitations, one of which is that it can only 
be taken to refer to logically or perhaps metaphysically possible 
worlds. It is the easiest thing in the world to imagine epistemically 
possible worlds in which a proper name refers to different objects. 
Thus, to the extent Kripke’s basic intuition is applicable to our ac- 
tual discourse, it excludes from the scope of his theory the most im- 
portant applications of intensional logics, namely, their application 
to our epistemic concepts. 

What is even more important, Kripke’s intuition involves a colossal 
presupposition. He is assuming that it makes unproblematic sense to 
speak of the identity of two individuals residing in two different pos- 
sible worlds. He fails to see that there is nothing in our notion of ref- 
erence, either intuitively or explicated along the natural 
possible-worlds lines, that prejudges such questions of cross identity. 
For the purpose of the reference of, say, a singular term, it suffices to 
determine the individual (if any) for which it stands in each logically 
possible world. But such a determination is possible without presup- 
posing anything about which individual is which in another possible 
scenario. And a modicum of analysis of how our actual discourse op- 
erates quickly shows that our principles of cross identification are in- 
deed largely independent of our criteria of reference. We constantly 
use two different systems of cross identification, which makes nom- 
sense of any unqualified idea of rigid designation. 

Kripke’s inadequate excuse for not doing the critical analytic work 
that the explication of his own intuitions requires is that the “possi 
ble worlds” with which he is dealing are our constructions out of the 
individuals that there are in the actual world. But this restricts the 
applicability of Kripke’s intuitions so much as to make them uninter- 


THE EMPEROR'S NEW INTUITIONS 141 


esting. In any half-way realistic application, the different scenarios 
contain different entities, most of which have no counterparts in the 
actual world. And these scenarios are not constructed by us in any 
reasonable (or intuitive) sense of the word. They are typically the ob- 
jective possibilities we have to heed and from which some nonhu- 
man agent—nature, chance, god—chooses one to be realized. When 
the National Hurricane Center is preparing for a new season, it as- 
signs proper names to future hurricanes in the order of their ap- 
pearance, but there is no presumption that in the different possible 
scenarios for this season, the equally named hurricanes are in some 
metaphysical sense the same storm. In 1992, it might very well have 
happened that the first hurricane of the season never touched the 
mainland of the United States and instead petered out in the At- 
lantic. I doubt that the good citizens of the Miami area would say 
that such a possible hurricane was the same as the notorious hurri- 
cane Andrew that was the most expensive storm in United States his- 
tory. Yet it would have had the very same properly conferred name. 
If Kripke’s theory were literally true, the National Weather Service 
could be accused of grossly abusing the English language by naming 
hurricanes the way they in fact do. And, what is most important here, 
if that possible namesake of hurricane Andrew had actually been re- 
alized, it would not have been constructed by Kripke as a part of his 
semantical theory or by any other agent, except (anthropomorphi 
cally speaking) by nature. 

The upshot is that Kripke’s intuitions about what possible scenarios 
(“worlds”) are thus are even more limited in scope than his intuitions 
about rigid designation. Obviously, in his attempt to visualize his own 
“relations of ideas” about possible scenarios, Kripke has had in mind 
only the kinds of artificially simple combinatorial possibilities that 
probability theorists use in explaining to students their notion of sam- 
ple space, such as different outcomes of a series of tosses of a coin or 
a dice or of outcomes of draws of balls from an urn. Unlike Kripke, 

_ the probability theorists nevertheless realize fully that most of the 
‘? reaHife applications of their concepts are to sample spaces messier by 
orders of magnitude than such artificial Las Vegas applications. How 
unrealistic Kripke is can be shown by the fact that even the simplicity 
of artificial gambling examples is arguably only skin deep. I have illus- 
trated this point in another (unpublished) paper by reference to the 
sample space of possible outcomes of a toss of dice: 


[But], what if the high roller in question is King Olaf and his dice 
splits into two, what do you do? Do you consider both halves of the bro- 
ken dice as legitimate continuations of the original dice and add up 
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what they show, or is only one of them identical with the original one? 

If so which one? Or is perhaps neither one identical with the old dice?” 

In the first case, an unexpected identificatory situation creates the split- 

ting of sample-space points into several. In the last one, several sample- 

space points merge with each other. Even small-scale examples [of 
sample spaces] are thus, in principle, not immune to questions of ider 
tification. 

What Kripke should have done is to try to work his intuitions pa- 
tiently and critically into an explicit framework of expressing the in- 
tuitions and whatever else he wanted to express. This would have 
had to include expressing the methods of identification which 
Kripke presupposed but did not acknowledge. It seems that he 
might have been prevented from recognizing them by his uncritical 
adherence to other intuitions. Those intuitions are the ones on 
which the received Frege-Russell logic of quantifiers are based and 
which were seen to be unduly restrictive. For what is needed for the 
purpose of expressing in an obvious way the rigidity of reference by 
a singular term bin a context governed by the necessity operator N? 
The obvious way is: 


(3) (dx) N(b = x) 


For what (3) says is that there is an individual—one and the same in- 
dividual—that is referred to by bin all possible worlds. But according 
to the holy rules of the received first-order logic, canonized by the 
intuitions of most post-Fregean logicians, (1) is entailed by: 


(4) Nb=3) 


which merely says that 6 is self-identical in each such possible world. 
Hence an attempted inference from (4) to (3) springs from a confu- 
sion between ‘each’ and ‘all’. There is nothing in (4) to guarantee 
that bis the same individual in different possible worlds. More gener 
ally, many philosophers’ belief in the universal validity of existential 
generalization is due to the fact that in “intuiting” this validity, they 
had before their intuition’s eyes a situation involving only what can 
be said of the actual world (or in any case some one and only possi- 
ble world). Such intuitions easily become seriously misleading when 
we are explicitly or tacitly considering several possible scenarios 
(“worlds”). This nonsensical result makes (3) useless as an expres- 
sion of rigidity in all logics that assume the validity of existential 


» Cf. Ivar Ekelund, The Broken Dice and Other Mathematical Tales of Chance 
(Chicago: University Press, 1993), p. 3. 
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generalization. Once again, what is needed is a model-theoretical 
re-education of our intuitions. If (3) expresses what it obviously is 
calculated to express, namely, that ġ picks one and the same object 
in all the possible worlds brought into play by the necessity operator 
N, it is not implied by (4). Hence the first thing philosophers dis- 
cussing identification and reference should do is to reformulate the 
logic of quantification as it is used in modal and intensional contexts 
when it comes to existential generalization. This amounts to the 
recognition of the limited scope of the intuitions which originally 
prompted the received Frege-Russell logic of quantification, promi- 
nently including the rule of existential generalization. 

What has been said so far might give the reader the impression 
that J am against all uses of intuitions in philosophical and scientific 
reasoning. Such an impression would be wrong. What I am against 
are the reifications and mystifications surrounding both philoso- 
phers’ and laymen’s ideas about intuitions. These intuitions can be 
divided into two classes. They are the alleged intuitions concerning 
empirical truths and those concerning conceptual (including lin 
guistic) ones. (David Hume might have called them intuitions about 
matters of fact and intuitions about relations of ideas.) 

As far as empirical realities are concerned, the brute fact is that 
the intimations of intuition do not have any privileged epistemologi 
cal status. They do not carry any automatic justification with them, 
no matter how convincing they may be subjectively. Epistemologi- 
cally, they are on the level of clever guesses or perhaps of Aristotle’s 
endoxa, We are all familiar (I hope) with insights that are instantly 
persuasive, but we are also familiar with equal persuasive insights 
that subsequently turn out to be mistaken. 

Furthermore, there is no evidence that such insights, even when 
valid, are products of any particular mental faculty. Elsewhere, I have 
shown that what are explicitly called intuitions often turn out to be 
products of perfectly ordinary discursive thinking, combined with 
suitable observations. Even when one can locate a characteristic fea 
ture of what are called intuitive insights, that special character does 
not presuppose a separate faculty or capacity. I believe that, espe- 
cially in mathematical and logical contexts, appeals to intuitions typi 
cally amount to uses of modeltheoretical insights or concepts. For 
instance, why is it more “intuitive” to carry out a geometrical proof 
in terms of figures instead of (say) first-order logic? This question is 
pertinent because it is perfectly possible to conceive of the use of fig- 
ures as being nothing but an alternative symbolism parallel with the 
first-order logical symbolism and intertranslatable with it. A proof in 
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either symbolism can be equally stringent and in fact analogous, so 
that even in the purely logical proof one can speak, for example, of 
the introduction of new individuals into the argument in analogy 
with the “auxiliary constructions” of traditional geometrical proofs. 
The answer to the question is that the use of figures brings out much 
more directly the modeHheoretical meaning of a geometrical proof. 

It is characteristic of the misconceptions that I am criticizing that 
this “secret” of the intuitiveness of proofs by means of figures has 
been misinterpreted. It is generally thought that figures served the 
purpose of facilitating appeals to our geometrical intuition for the 
purpose of introducing assumptions not sanctioned by the axioms 
and postulates of geometry. Such appeals are admittedly found in 
Euclid, but they are as much violations of the rules of figurative argu- 
mentation as the corresponding fallacious steps would be violations 
of the rules of logical proof. Typically, the assumptions so intro- 
duced were manifestations of the continuity properties of geometri- 
cal objects. Such properties were for a long time difficult for 
mathematicians and logicians to formulate in any notation. 

From these observations it follows that it is nonsense to think that 
“intuitive” insights must be unconscious. Sometimes they are, but even 
then it might be possible for the reasoner in question to recognize the 
discursive steps of reasoning which originally surfaced in the form of a 
single “intuition.” In others, special techniques might be required, 
ranging from leading questions to truth sera and hypnosis. In others, it 
is clear that an “intuitive” judgment is the internalized product of train- 
ing that utilizes perfectly discursive means in teaching the reasoner to 
notice subtle observational clues. One can enhance the intuitiveness of 
one’s own reasoning quite consciously, as in the use of figures and dia- 
grams. All told, the unavoidably unconscious nature of intuitions is 
nothing but a pernicious myth we should quickly get rid of. 

It may even be the case that we humans are hardwired to carry out 
certain types of reasoning more “intuitively” than others, probably 
for evolutionary reasons. For instance, we apparently have fairly 
sharp intuitions concerning the relationships involved in Euclidean 
geometry—sharp enough to have led Kant to think that they signal a 
special status of Euclidean geometry. Such intuitiveness can be ex- 
ploited for analytic and systematic purposes. For instance, it was a 
brilliant idea of David Hilbert’s and John von Neumann’s to think 
systematically of the relationships among certain functions in anal- 
ogy with the interrelations of geometrical entities like vectors. But 
such intuitiveness does not confer any privileged epistemological sta- 
tus on the resulting geometrized theories. 
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Underlying several of the points just made is an important distinc- 
tion between heuristics and justification. In a specifiable objective 
sense, it is easier for human beings to discover and to formulate geo- 
metrical proof, for example, by using such “intuitive” techniques as 
figures. But that does not make such proofs any more or any less 
stringent than logical proofs conducted in the pedestrian notation 
of formal logic. 

Thus, when it comes to empirical truths and empirical theories in- 
dependent of my own concepts and my own language, it might very 
well be wisest to forget the ill-conceived concept of intuition alto- 
gether. Of course, this does not deny the reality of the phenomena 
that people are trying to fit under this term. 

But what about my knowledge of my own language and my own 
concepts? Do I not have, for example, a privileged access to what I 
mean by my words, of the kind someone might label intuition? This 
would not be anything stranger than the intuitions of grammaticality 
on which transformational grammarians at one time used to rely. I 
have no objection to granting the existence of such access to my own 
concepts and meanings. What I challenge, however, is the privileged 
epistemological character of any knowledge obtained through this 
allegedly privileged access. 

How do I find out about another person’s language or about her 
concepts? For instance, what does she mean by a certain word? The 
first approximation to an answer is: Why not see how that person 
uses the word? Or perhaps you can simply ask her. If you want to 
find out how your informant uses color words, put her in a room 
with differently colored objects and observe which color word she 
uses for each. Or you can use a color chart and ask the informant 
which word is applicable to each of them. 

This simple (and in some sense simple-minded answer) in fact car- 
ries us much further than most philosophers seem to be willing to 
admit. It may be the case that neither method gives us an indu- 
bitable particular case from which to generalize, but I believe that we 
must adopt a fallibilist epistemology anyway. Quine and some others 
have argued that we cannot exhaustively determine the meaning of a 
word in this way. This argument is in the last instance predicated on 
a rejection of the idea of possible scenario (“possible world”), as is 
shown in my “Three Dogmas of Quine’s Empiricism.”"* If that notion 
is allowed, then there is, in principle, no definite limit as to how far 
your experimentation (construction of ever new situations) can carry 
you in determining the class of scenarios in which the word does or 


" Revue Internationale de Philosophie, LI (1997): 457-77. 
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does not apply. And such a determination will, at least for a Mon- 
tagovian semanticist, determine the meaning of the word. Indeed, in 
Montague semantics, the meaning of a term is the function that 
maps possible worlds on references (extensions) of the appropriate 
logical type (category). And such functions can, in principle, be 
identified even more and more fully by systematic experimentation 
with the references that a person assigns to his terms in different ac- 
tual or imagined scenarios. 

The sketchiness of this account is blatant. But this approximate 
character of my account does not matter here, for I am not trying to 
give a full account of meaning determination but to compare third- 
person (or second-person) and first-person cases. And if so, I can 
simply say that what is semantics for a goose is also for a gander. In 
other words, the third-person and the first-person cases are, in prin- 
ciple, on a par. I can put myself in different situations and find out 
how I apply my words in them. Or I can put a question to myself. 
(There is nothing in the logic of questions and answers that prevents 
my doing so.) The only difference is that in the first-person cases (of 
either kind) I can typically skip the stage setting. Instead of putting 
myself physically in a certain kind of situation and seeing what I shall 
do in it, I can imagine myself as being in such a situation. Normally, 
I can confidently say how I would use my words and my concepts in 
them. Of course, I cannot always do so, as J. L. Austin’s™ bullfinch 
reminds us, and sometimes I can be wrong. But this does not affect 
my Main point. It is simply an aspect of this analogy between first- 
person and third-person cases, in that there can be fuzziness in our 
concepts in either case. 

I do not see any reason to deny that I can by means of such a 
thought experiment obtain objective knowledge about my own lan- 
guage and my own concepts, just as I can do so in the case of others 
by real experimentation. J do not need a concrete stage setting for 
the purpose. Thought experiments serve as well in the first-person 
case as real experiments in the third-person cases. If someone now 
wants to label the answers obtained by means of such thought exper- 
imental “intuitions,” I do not have any objections, as long as it is real- 
ized what is involved. Such intuitions are not unlike Aristotelian 
ones, obtained by realizing the appropriate forms (concepts) in my 
soul. Unlike Aristotelian intuitions, however, the results of my 
thought experiments need not be infallible. Unlike Aristotle, I am 
also maintaining that such intuitions are informative only about our 


= Philosophical Papers (New York: Oxford, 1961), p. 56. 
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own language and about our concepts. Hence there is room in our 
epistemological arsenal, I believe, for intuitions with a respectable 
historical pedigree. 

Unfortunately, the vast majority of appeals to intuition by contem- 
porary philosophers cannot be conceived as controlled thought ex- 
periments nor be justified by recasting them as such. In view of such 
goings-on, I am tempted to suggest, half-jokingly—but only hal§ok- 
ingly—that the editors of philosophy journals agree to a moratorium 
on all papers in which intuitions are appealed to, unless the basis of 
those appeals is made explicit. 

JAAKKO HINTIKKA 
Boston University 
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COMMENTS AND CRITICISM 


SUPERVALUATION CAN LEAVE TRUTH-VALUE 
GAPS AFTER ALL* 


mong other good things, supervaluation is supposed to allow 
At sentences to go without truth values.’ But Jerry Fodor 

and Ernest LePore? have recently argued that it cannot allow 
this-—not if it also respects certain conceptual truths. The main 
point I wish to make here is that they are mistaken. Supervaluation 
can leave truth-value gaps while respecting the conceptual truths 
they have in mind. 

Fodor and LePore are not the only ones to doubt that supervalua- 
tion really can do all that it has been held out to do. Supervalua- 
tional treatments of vagueness have been said to preserve classical 
logic, to solve the sorites paradoxes, to account for higher-order 
vagueness, and more besides. All of this has been called into ques- 
tion.’ But Fodor and LePore claim to add something new to the case 
against supervaluational treatments—they claim to have cast doubt 
on their very inteliguinkty. It will emerge that they have done nothing 
of the sort. 


* I would hke to thank Steve Kuhn, Alice Kyburg, Ernest LePore, Jeff Pelleter, 
Achille Varzi, Timothy Williamson, and members of the Logic and Language 
Group of Georgetown University and the University of Maryland for helpful com 
ments, 

! Supervaluational treatments of vagueness go back to a suggestion of H, 
M in The Reach of Saence (Toronto: University Press, 1956). Related ideas can 
be found in M. Przelech, "Z Semantyki Pojec Otwartych,” Studia Logica, xv (1964): 
189-220 (translated as “The Semantics of Open Concepts,” in J. Pelc, ed., Semiotics 
m Poland, 1896-1969 (Dordrecht: Reidel, 1979), pp. nyc Michael Dummett, 
“Wang’s Paradox,” Synthese, xxx (1975): 301-24; David Lewis, “General Semantics,” 
in Donald Davidson and Gilbert Harman, eds., Semaniscs of Natural Language 
(Boston: Reidel, 1972, 2nd ed.), pp. 169-218; and Hans Kamp, “Two Theones 
about Adjectives,” ın E.L. Keenan, ed., Formal Semantics of Natural Language (New 
York: Cambridge, 1975), pp. 128-55. The supervaluational approach uw taken fur- 
thest by Kit Fine, in “Vagueness, Truth and Logic,” Synthese, xxx (1975): 265-300. 

*“What Cannot be Evaluated Cannot be Evaluated and It Cannot Be Superval 
ued Either,” this JOURNAL, xvu, 10 (October 1996): 516-35. 

> Timothy Willameon argues that ions do not preserve classical logic— 
Vagueness (New York: Routiedge, 1994). Kamp criticizes the supervaluational solu- 
tion to the sorites paradoxes-—“The Paradox of the Heap,” in U. Monnich, ed., 
Aspects of Philosophical Logic (Boston: Reidel, 1981), pp. 157-74; so does R.M. Sains 
bury—Paradoxas (New York: Cambridge, 1988). Colicims af the supervaluational 
treatment of higher-order vagueness can be found in Willtamson; and in Sainsbury, 

without Boundaries,” in R. Keefe and P. Smith, eds., Vagueness: A Reader 
(Cambridge: MIT, 1996), pp. 251-64. 
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I shall proceed as follows. In section 1, I set out supervaluational 
interpretations of vague languages; in section O, | state Fodor and 
LePore’s objection to these; finally, in section m, I argue that this ob- 
jection is groundless. Not only can supervaluation leave truth-value 
gaps while respecting Fodor and LePore’s conceptual truths; we 


shall see that it must leave them, if it is to respect these truths. 
I SUPERVALUATIONAL TREATMENTS OF VAGUENESS 


To begin with, I shall expound the supervaluation of vague sen- 
tences. I shall follow Kit Fine’s treatment (op. at.). This discussion is 
a supplement to Fodor and LePore’s, which oversimplifies at one 
crucial point that I shall get to shortly. 

Underlying supervaluational treatments is the idea that vagueness 
is a variety of semantic incompleteness. Take the vague adjective 
‘bald’. It is semantically incomplete insofar as it can happen that, be- 
sides those individuals which are determinately bald and those which 
are determinately not bald, some borderline cases are neither. Gou- 
pling a vague adjective with the name of a borderline case results in 
a sentence—say, ‘Al is bald’-—that is in turn semantically incomplete: 
it is neither true nor false; it has a fruth-value gap. 

Supervaluational treatments depend furthermore on the idea that 
vagueness can be reduced by resolving indeterminacy. With a vague 
adjective that happens when, leaving determinate cases untouched, 
borderline cases are resolved one way or the other. For example, 
‘bald’ can be made more precise in various ways, accordingly as bor- 
derline cases are included among the determinately bald or among 
the determinately not-bald. The different ways of making vague ex- 
pressions more precise are precisifications. With a view to making 
things concrete, we can regard them as assignments of suitable ex- 
tensions to vague expressions; so each precisification assigns to 
‘bald’ a positive extension of individuals that are determinately bald, 
and a negative extension of individuals that are determinately not- 
bald. One precisification p* is more precise than another, p, if the posi- 
tive extensions that p* assigns to expressions are more inclusive than 
the positive extensions that p assigns to them, while the negative ex- 
tensions that p* assigns are also more inclusive than the correspond- 
ing negative extensions that p assigns. 

The intuitively understood meanings of vague expressions limit 
the ways of making them more precise. Internal constraints control 
the boundaries of single predicates: if on making ‘bald’ more precise 
we count one man in, then we must also count in every man with 
even less hair. Other eadernal constraints control the common bound- 
aries of several predicates: making ‘red’ and ‘pink’ more precise 
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shall not lead us to count the same blob as both. Fine calls these two 
kinds of constraints penumbral connections. Precisifications within the 
limits of penumbral connections are admissible. 

Fine models precisifications after the disambiguations of ambigu- 
ous expressions. He seems to think that vagueness and ambiguity are 
closely related: 


[T]he only formal difference is that the precisifications may be infinite, 
even indefinite, and may be subject to penumbral connection. Vague- 
ness is ambiguity on a vast and systematic scale (op. at, p. 282). 


The idea that vagueness is on a par with ambiguity is surely contro- 
versial, though. For one thing, whereas disambiguations are contex- 
tual, insofar as we typically assert just one disambiguation of an 
ambiguous sentence, precisifications are not contextual in quite this 
way. We do not pick just one complete precisification of a vague sen- 
tence and assert that. 

Even so, precisifications might be somewhat contextual. Consider 
the possibilities for reference with vague definite descriptions. Sup- 
pose Al is at a bar being salient along with Mick Jagger, David Bowie, 
and others who, unlike Al, still have a lot of hair. Then someone 
spots him and points him out: “Look, there’s All He’s the bald one.” 
Now, Al is not really bald. He is borderline. Even so, this utterance of 
‘the bald one’ denotes him. This can be accounted for if we suppose 
that, although we never pick one complete precisification, we some- 
times do pick a more or less limited range of them. We can suppose 
that this utterance leads us to pick those ways of drawing a precise 
boundary which include, among the bald, a single salient individual; 
for, in keeping with the relevant penumbral connection, each such 
precisification will single out Al. Our picking these precisifications 
we can regard as an accommodation, furnished to enable the deft 
nite description to denote. 

Precisifications, if somewhat contextual, are in this regard at least 
somewhat like disambiguations. Be this as it may, there is no need to 
tie vagueness to ambiguity here, and some advantage to keeping 
them separate. We can simply think of the precisifications of a vague 
expression as all of the different ways of drawing precise boundaries 
that resolve borderline cases. The question whether vagueness and 
ambiguity are related can be left to one side. 

So much for precisifications. I shall now turn to the evaluation of 
vague sentences. On a supervaluational treatment, the truth of, say, 
‘Al is bald’ depends on two things. One is Al’s hair. The other is the 
precisification of ‘bald’. Let us put aside questions of temporality 
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and modality, so that we need not consider the hair Al once had but 
no longer has, or hair he might have had but actually does not. 
Then we can think of the nonlinguistic matters on which truth de- 
pends, such as Al’s head of hair, as fixed. Truth will depend only on 
the different precisifications of vague expressions. 

Supervaluation amounts, as Fine puts it, to “a sort of principle of 
non-pedantry: truth is secured if it does not turn upon what one 
means” (op. at, p. 278). The idea is that a vague sentence shall be 
counted true if it is true on each way of making it more precise. On 
the simplest view, the only precisifications that matter are the com 
plete ones, which leave nothing at all indeterminate. It is there that 
evaluation begins. Evaluation at a complete precisification is classi- 
cal. Each individual falls either within the positive extension of any 
given predicate, or within its negative extension, so each atomic sen- 
tence can be assigned a determinate truth value, true or false. Sen- 
tences compounded with the truth-functional connectives ‘and’, 
‘not’, and so on follow suit, according to the classical truth defini 
tions. In this way, each complete precisification determines a classi- 
cal valuation. 

Moving now from complete precisifications to those which retain 
some indeterminacy, truth goes by the principle of nonpedantry. A 
sentence is counted true if it is true on each way of making it com- 
pletely precise: 


condition, Sis true (/false) on precisification p just in case 
for every admussible complete p* that is more precise than p, Sis true 
(/false) on p*. 


A sentence will go without a truth value on pif it is true on some ad- 
missible complete precisifications of p, but false on others. So an in- 
complete precisification determines a partial valuation, on which 
some sentences are true, others are false, and yet others are neither. 
We evaluate sentences relative to a preferred precisification that is 
appropriate, insofar as it retains a measure of indeterminacy that re- 
flects the intuitively understood meanings of vague expressions. 
Fodor and LePore call such an appropriate precisification a base 
model, or simply a model (518-19). But the super-truth condition calls 
on other, complete precisifications. An appropriate precisification 
and these others together make up an interpretation, or what Fine 
calls a specification space. An interpretation is a tree with the appropri 
ate precisification at the base of the trunk, with increasingly precise 
ones as we move upward through the branches, and with complete 
precisifications at the leaves. Truth, falsity, and indeterminacy go by 
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the appropriate precisification: a sentence is true in an interpreta 
tion if true on each of its complete precisifications; false if false on 
each of them; otherwise, the sentence is indeterminate. 

It is now possible to say at which point Fodor and LePore oversim- 
plify. They interpret vague language in models that are, in effect, iso- 
lated precisifications. My interpretations and Fine’s specification spaces, 
on the other hand, include a range of different precisifications. In sec- 
tion m, it will become clear that Fodor and LePore’s treatment does not 
lend itself to a proper understanding of the expression ‘there is no mat- 
ter of fact about whether’. Meanwhile, in the following section, we shall 


see that this matter lies right at the heart of their objection. 
Il. FODOR AND LEPORE’S OBJECTION 


It would appear that supervaluation will leave ‘Al is bald’ without a 
truth value, if Al is borderline bald. Not so, say Fodor and LePore; 
not if supervaluation honors the principle: 


(P) Conceptual truths must be respected by all classical models, includ- 
ing classical valuations (521). 


Fodor and LePore argue that there are conceptual truths which (P) 
demands classical valuations respect but which, if respected, rule out 
complete precisifications. This will in turn rule out truth-value gaps, 
since for a sentence to go without a truth value there have to be 
complete precisifications on which it is true, and others on which it 
is false. 

To give an example, we must first decide just what kind of thing 
Fodor and LePore take conceptual truths to be: perhaps sentences, 
or the contents of sentences, or something else again? They do not 
say.* Plausibly conceptual truths are sentences;* in this case, the fol- 
lowing is a representative example: 


(1) There is no fact of the matter about whether a man one ninth of 
whose head is covered with hair is bald (522), 


Nothing depends on the fraction one ninth, of course. You can sub- 
stitute any other you like, if you think a man with that much hair isa 
borderline case. Fodor and LePore argue that we have here not 
merely a truth, but indeed a conceptual truth. For the sake of the ar- 
gument, let us allow that it is so. 


‘What Fodor and LePore do say does not settle this: “[W]e use ‘conceptual 
truths’ in (P) as a dummy-térm for contributions that nonlogical vocabulary 
makes to meaning” (521). Sentences, maybe; their contents, maybe. But contridw- 
hons? 

* What, otherwise, are we to make of (P)? It is to sentences that valuations assign 
truth values, not to their contents, or to anything else. 
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Fodor and LePore’s conceptual truths must be respected by all 
classical valuations, according to (P). So they must hold true on all 
complete precisifications (since these determine classical valua- 
tions). That is, what (P) demands is that these truths shall enjoy the 
favored status of Fine’s penumbral truths, whose truth on all complete 
precisifications is secured by penumbral connections. An example is: 
‘If one man is bald, then so is every other man with even less hair’. 
Fodor and LePore do not offer a generally compelling reason to ac- 
cept this demand of (P). They do not really need to, though, since it 
is sufficient for their purpose that the supervaluakonisi must accept it 
and this, it seems, he must. Honoring (P) seems to be the motivation 
for introducing penumbral constraints in the first place, and if the 
supervaluationist is out to honor this principle at all, he ought to 
honor it fully. He cannot go picking and choosing between concep- 
tual truths, expecting some of them to hold on all complete precisifi- 
cations, but not others. 

But fully honoring (P) is the undoing of supervaluational treat- 
ments of vagueness, according to Fodor and LePore. They argue 
that it trivializes such treatments by ruling out complete precisifica- 
tions. Indeed, it might well appear that they are onto something. To 
ensure that Fine’s penumbral truths are respected, we rule out all 
precisifications on which they fail to hold true. This is the effect of 
introducing penumbral constraints. But if we try to ensure respect 
for Fodor and LePore’s conceptual truths in the same way, it might 
seem, we will end up with no complete precisifications at all. Take 
(1) for example. How, one might wonder, can this sentence hold 
true on a complete precisification? Complete precisifications resolve 
all indeterminacy; they place every individual within either the post 
tive or the negative extension of every predicate. And where there is 
no indeterminacy, there is a fact of the matter about everything. So, 
one might think, on a complete precisification there is a fact of the 
matter about whether a man, one ninth of whose head is covered 
with hair, is bald. Since this is the very thing that (1) denies, it might 
seem that (1) cannot hold true on a complete precisification. 


I HOW TO HAVE TROTH-VALUE GAPS AND ALSO RESPECT CONCEPTUAL 
TRUTHS 


This reasoning should be resisted. Precisifications do resolve indeter- 
minacy, and complete precisifications resolve it completely. But to 
resolve indeterminacy is not to remove it, since where indeterminacy 
can be resolved in a range of different ways, it remains present in the 
multiplicity of different ways of resolving it. This idea is the basis of 
my criticism of Fodor and LePore’s argument. 
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Fodor and LePore’s conceptual truths assert that there is a certain 
indeterminacy. I argue that what is required, if they are to be re- 
spected, is variety among precisifications. For example, respecting 
(1) requires that there be two kinds of complete precisifications: 
some on which men one ninth of whose heads are covered with hair 
are included among the bald, and others on which such men are not 
included. On a proper understanding, I argue, respecting Fodor and 
LePore’s conceptual truths does not require ruling out precisifica- 
tions, in addition to those ruled out by Fine’s penumbral constraints. 
On the contrary, what it requires is ruling precisifications tm, to safe- 
guard variety. This completely undercuts Fodor and LePore’s objec- 
tion. As we shall see, honoring (P) is not merely compatible with 
truth-value gaps. It guarantees that there are some. 

Let us turn first to the sentential operator ‘it is indeterminate 
whether’. Fodor and LePore use it interchangeably with ‘there is no 
matter of fact about whether’; I shall soon use it to render their con- 
ceptual truths. Consider a sentence with this operator, say, ‘It is inde- 
terminate whether Al is bald’. This sentence is to hold true if Al is 
borderline bald, which is to say if some complete precisifications in- 
clude Al among the bald, while others do not. 

Notice that this is what might be called a global truth condition. An 
interpretation, comprising a space of precisifications, is the sort of 
thing that can satisfy it. An isolated precisification is not. One might 
think that, having global truth conditions, sentences with the opera- 
tor ‘it is indeterminate whether’ cannot be evaluated on the individ- 
ual precisifications of an interpretation. In this case, these sentences 
will somehow have to be evaluated without the super-truth condi 
tion. Maybe Fodor and LePore’s conceptual truths, rendered with 
this operator, can be treated in such a way: evaluated, indeed vali 
dated, but not supervalued. 

In fact, though, it is straightforward to evaluate the relevant sen- 
tences on individual precisifications. According to the following 
truth condition, a precisification that satisfies such a sentence does 
so in virtue of its place within a space of suitably various precisifica- 
tions: 

Truth condition for ‘Ut is indeterminata whether’. ‘It is indeterminate 
whether S’ is true on a complete precisification, in any given interpre- 
tation, if S 1s true on some complete precisifications of the interpreta- 
tion, but false on others. 


This truth condition ignores the higher-order vagueness that 
arises where borderlines themselves have borderlines. For example, 
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‘bald’ is vague in the second-, third-, and higher degrees if one 
might be a borderline case of a borderline bald man, a borderline 
case of a borderline case of a borderline bald man, and so on. To 
take higher-order vagueness into account, we could allow that ‘It is 
indeterminate whether Al is bald’ can be true on some ways of mak- 
ing ‘bald’ more precise (those in which, once all higher-order vague- 
ness has been resolved, Al lands in the borderline of ‘bald’) but false 
on others (in which he lands elsewhere). Clearly, my truth condition 
does not allow this. Higher-order vagueness does not seem to be rel- 
evant to this discussion, though, and I shall now put it aside. My crit- 
cism of Fodor and LePore’s argument can also be made, at greater 
length, while taking it into account. 

Given the restriction to first-order vagueness, I claim, this is the 
proper treatment of ‘it is indeterminate whether’.* Notice that Fodor 
and LePore’s version of supervaluation does not lend itself to this 
treatment. Once this operator is taken into account, truth on a com- 
plete precisification is a global matter, involving a space of precisi} 
cations. But, as I remarked in section I, Fodor and LePore seem to 
think that, on a supervaluational approach, vague language is to be 
interpreted in models that are, in effect, isolated precisifications. At 
this crucial point they oversimplify. 

Returning now to Fodor and LePore’s conceptual truths, we can 
take (1) as a representative example and render it as follows: 


For each x, if x is a man and one ninth of x's head is covered with 
hair, then it is indeterminate whether <x is bald. 


This illustrates my claim that, as seen from a supervaluational per- 
spective, Fodor and LePore’s conceptual truths hold true in virtue of 
the variety to be found among precisifications. Sentence (1) holds 
true (on one complete precisification of an interpretation, indeed 
on all such precisifications and, by the super-truth condition, in the 
interpretation itself) provided there are two kinds of complete pre- 
cisifications: some in which men one ninth of whose heads are cov- 
ered with hair fall within the positive extension of ‘bald’, and others 
in which they fall within the negative extension. This conceptual 


* Others agree. Dummett (op. ai, p. 311) and Fine (op. at, p. 289) both suggest 
a treatment of the sentential operator ‘it ıs definitely the case that’ which parallels 
the possible-worlds treatment of necessity: letting ‘D’ stand for this operator, a sen- 
tence DS will true (on any given precistication) just in case 5 holds true on all rele- 
vant precisifications. Fine (op. at, pp. 287-89) goes on to define in terms of Dan 
operator / that is equivalent to ‘it is indeterminate whether’. He gives an ingenious 
treatment allowing for higher-order vagueness (op. at, pp. 293-94, but watch for ty- 
pographical errors). Williamson repeats this treatment (op cit, p. 159). 
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truth is respected, in the sense of principle (P), provided all inter- 
pretations of English include precisifications of these two kinds. 

The problem with Fodor and LePore’s objection is now in plain 
view. They claim that honoring (P) rules out complete precisifica- 
tions, and with them truth-value gaps. But it is simply not so. Inter- 
pretations of English can respect their conceptual truths while 
having truth-value gaps. An interpretation that includes complete 
precisifications of the relevant two kinds respects (1), and leaves ‘Al 
is bald’ without a truth value: this sentence is true on precisifications 
of the first kind, in which Al falls within the positive extension of 
‘bald’; it is false on those of the second kind, in which he falls within 
the negative extension; so, by the super-truth condition, this sen- 
tence goes without a truth value. In fact, honoring (P) is not merely 
compatible with truth-value gaps. It guarantees that there are some, 
since if (1) is to hold true on each complete precisification of each 
interpretation of English, then each such interpretation will have to 
have complete precisifications of the relevant two kinds.’ There are 
sure to be truth-value gaps, given some named men with the right 
amount of hair. 

So Fodor and LePore are quite mistaken. Among other good 
things, supervaluation can leave truth-value gaps while validating the 
conceptual truths they have in mind. 

MICHAEL MORREAU 
University of Maryland/College Park 


” Strictly speaking, each sentrvial interpretation, having complete precisifica- 
tions, will have to have them. 
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NOTES AND NEWS 


The Editors regret the untimely death of the American philosopher Nelson 
Goodman on November 25, 1998, at the age of 92. He received his B.S. in 
1928 and his Ph.D. m 1941, both from Harvard University. Goodman was a 
Professor of Philosophy at Harvard University from 1968-1977. Previously he 
served as Professor of Philosophy at Brandeis University from 1964-1967, Pro- 
fessor from 1951-1964 and Associate Professor from 1946-1951 at the Univer- 
sity of Pennsylvania, and Instructor at Tufts College, 1944-1945. His books 
include A Study of Quahties (1941), The Structure of Appearance (1951), Fact, Fic- 
tion, and Forecast (1955), Languages of Art (1968), Problems and Projects (1972), 
Ways of Worldmaking (1978), Of Mind and Other Matters (1984), Reconceptons in 
Phtlosophy and Other Arts and Sciences, with Catherine Z. Elgin (1988), L’ Art en 
théorie et en achon (1996). Throughout his lifetime, Goodman was the recipi 
ent of many honors and awards. He served as President of the American 
Philosophical Association, Eastern Division, in 1967, and as Vice President of 
the Association for Symbolic Logic, 1950-1952. He was a Fellow of the Ameni- 
can Academy of Arts and Sciences and a corresponding Fellow of the British 
Academy. Among his many distinguished lectures and lecture series were: 
Alfred North Whitehead Lecture, Harvard University, 1962; John Locke Lec- 
tures, Oxford University, 1962; Miller Lectures, University of Illinois, 1974; 
Immanuel Kant Lectures, Stanford University, 1976; and the Howison Lec- 
ture, University of California, Berkeley, 1985. Goodman's activities related to 
art extended beyond philosophy. He was the founder, and director of Project 
Zero at Harvard, 1967-1971, an interdisciplinary research program investigat- 
ing aesthetic education. He was also producer of the Arts Orientation Series, 
1969-1971, and Director of the Dance Center, both at Harvard. The Philoso- 
phy Department at Harvard is reportedly planning a colloquium in his 
honor, possibly in the spring or fall of 1999. 
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N N ee 
“Whatever is, is one...whatever are, itemany- A 


—Bertrand Russell (1908)! 


ill and Hillary live together in Washington. Bill is one human 
Be So is Hillary. But they are two humans; they are differ- 

ent humans. What is it for them to be two humans? It is for 
them to instantiate a property, namely, being two humans. This is 
the view that I aim to clarify, defend, and develop in this paper. In 
doing so, I reject the standard conception of property (and relation) 
and defend a broader conception that allows nonstandard proper- 
ties like the one in question. The standard conception retains the 
long-standing bias against the many in favor of the one; the concep- 
tion contains principles to the effect that, roughly, many things as 
such. cannot instantiate a property. The broader conception results 
from rejecting those principles. 

I begin by clarifying the notion of the many in section I. In section 
i, I formulate and argue against the crucial principles of the stam 
dard conception. In section m, I clarify the language used to at- 
tribute nonstandard properties to the many. Then I elaborate on the 
broader conception, with the aid of the resources available in the 
language, in section Iv. Section V is an epilogue that places the con- 
ception in a historical perspective. 


* J am grateful to Joseph Almog, Tyler Burge, Kit Fine, David Kaplan, Bernard 
Linsky, Penelope Maddy, Tony Marun, and Graham Priest for their help at various 
stages of writing this paper. My work on this paper was partially supported by a Kib 
lam Postdoctoral Fellowship at the University of Alberta and a University of 
Queensland Postdoctoral Research Fellowship, which are hereby gratefully ac- 
knowledged. 

! Prnaples of Mathematics (New York: Norton, 1937, 2nd edition), p. 132. 
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I. MANY VERSUS ONE 

Here is the contrast between what is one and what are many. Bill is 
one. He is some one thing, specifically, one human being. Likewise 
with Hillary. But Bul and Hillary are many things, namely, more than 
one thing. They are, specifically, two humans. Moreover, the two hu- 
mans are many things only, not some one thing. By contrast, Bill is 
one thing only, not many things. He (or his body) Aas many parts: 
one heart, two eyes, four limbs, and the like. But this does not make 
Bill himself the many parts, any more than his having two parents 
makes him the two parents. Surely, the contrast does not rest on the 
fact that Bill and Hillary are both humans. Anything, any object, is 
one thing. It is irrelevant to this what kind of thing it is, whether it is 
an adult or a child, a human or a city, material or immaterial, spa- 
tiotemporal or not, concrete or abstract, and so on. Likewise, Bill 
and Hillary are two simply because they are not identical with each 
other. Whether they are similar or not, how closely related they are, 
and so on are immaterial to their being many. 

Some might object that the contrast somewhat breaks down be- 
cause there are complex objects, such as sets or classes, groups or col- 
lections, aggregates or mereological sums, or the so-called totalities 
or pluralities. They agree that such an object, too, is one thing, but 
object that it is many things as well. This is not correct. Consider, 
for example, the set {Bill, Hillary}, an object that in some sense 
comprehends both Bill and Hillary (but nothing else). The set is 
two, the objectors argue, because it ts (namely, is identical with) 
the two humans (though it is not either of them). But this assump- 
tion does not hold. Bill and Hillary are two humans; by contrast, 
the set is not two humans.” The set in a sense Aas two humans; it is 
related to them as its members. But this does not make the set itself 
two humans; it does so no more than Bill’s having two Americans 
as parents makes Bill himself two Americans.’ Thus a complex ob- 
ject, too, is one thing only, not many things. How many members, 
elements, or parts it has is as irrelevant to this as is how many par- 
ents a human has. 


t One might object that one cannot get ‘{Bill, Hillary} 1 two humans’ from ‘Bull 
and Hillary are two humans’ by replacing ‘Bill and Hillary’ with ‘{Bill, Hillary)’. Al- 
though I think the sentences have the same predicate ‘to be two humans’, which 
takes different forms, we can bypass this issue by considenng a contrast immune to 
the objecnon: Bll and Hillary, and Chelsea are three humans; {Bui Hillery} and 
Chelsea are not three humans (they are only two things: one set and one human) 

’ In Parts of Classes (Cambridge. Blackwell, 1991), David Lewis holds that the 
mereological sum of Bill and Hillary 1s in a sense identical with the two humans (so 
it is in a sense two humans). I have argued against this view in “Is Mereology Onto- 
logically Innocent?” Philosophical Studies, XC, 2 (February 1999): 141-60. 
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Some might respond by replacing, in effect, the talk of being two 
(or being many) with the talk of related properties that complex ob- 
jects instantiate, such as being two-membered, a property instantiated by 
any set (or class or whatever has members) with exactly two mem- 
bers. One cannot identify this latter property with the former, they 
concede; being two-membered is instantiated by the set {Bill, 
Hillary}, which it is not correct to say instantiates being two. Sull, 
they argue, one can hold that being two-membered is the real pred- 
icative component of facts like the following: 


(1) Bill and Hillary are two. 


That is, on their view, the sentence ‘Bill and Hillary are two’ states 
the same fact that the sentence ‘{Bill, Hillary} is two-membered’ 
states, that is, the fact that 


(2) {Bul, Hillary} is two-membered. 


but the former sentence is misleading about the nature of the fact 
because there is no property indicated by the apparent predicate ‘to 
be two’.* 

This view rests on an incorrect analysis of facts like (1). To see this, 
consider facts like the following: 


(3) Bull is not identical with Hillary [and Bill is a thing and Hillary is 
a thing]. 
Bill and Hillary must be two different things, insofar as Bill is not 
identical with Hillary (while Bill exists as a thing and so does 


Hillary). So: 
Thesis [: Fact (1) is a logical consequence of fact (3). 


But just because Bill is not identical with Hillary (while both Bill and 
Hillary exist) does not mean: 


t For an account of (natural) number based on this analysis, see Penelope 
, Realism m Mathematics (New York: Oxford, 1990). For example, she identi- 
fies the number two as the property of being two-membered. Fre accounts 
that identify numbers with a hand of objects (for example, (5, {@}}) can also be 
seen to be based on analymng facts like (1) as facts like (2) that contain relakonal 
properties characterized with reference to the objects (for example, being equinu- 
merous with {@, {@}}). For Frege’s account, which differs from the contemporary 
Fregean accounts in identifying numbers as extensions of concepts, see, for exam- 
ple, his Dis Grundlagen der Artihmenk (Breslau: Kébner, 1884), translated by J. L. 
Austin as The Foundations of Arithmetic (New York: Blackwell, 1950). In A System of 
Logic (New York: Harper, 1891, 8th edition), J. S. Mull identifies, for example, two 
as a property (having two parts) instanuated by an aggregate. This account, too, 
must rest on a simular analysis of facts like (1). 
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(4) There is something (for example, a set) of which Bill is a mem- 
ber and Hillary is a member. 


Thus, fact (4) is nota logical consequence of fact (3). Because, how- 
ever, fact (4) isa logical consequence of fact (2): 


Thesis II: Fact (2) is nota logical consequence of fact (3). 


Now, it follows from theses I and I that fact (1) cannot be reduced 
to fact (2). 

Underlying this argument is the thought that fact (1) is a fact 
which concerns both Bill and Hillary but which concerns nothing 
else; Bill and Hillary are two (consequently, many) things simply be- 
cause they are things not identical with each other. The fact that Bill 
is identical with himself does not concern the singleton {Bill}, which 
is not Bill, because the fact concerns nothing but Bill. Likewise, fact 
(1) does not concern the set {Bill, Hillary}, which is neither Bill nor 
Hillary, no matter how closely it might be related to both Bill and 
Hillary. It is the same with any complex object, be it the aggregate of 
the humans or the so-called plurality thereof. Thus, fact (1) cannot 
be analyzed as involving a complex object. 

The proper analysis of facts like (1), I shall argue, is the one im- 
plicit in the above distinction between the one and the many. Fact 
(1) is composed of (a) Bil and Hillary, on the one hand, and (b) be- 
ing two, on the other; its predicative component, being two, is a prop- 
erty instantiated by many things (for example, Bill and Hillary) but 
not by any one of them. Similarly, the fact that Bill and Hillary are 
many is formed by being many, a property instantiated by no object, 
simple or complex. Accepting this analysis requires a radical depar- 
ture from the standard conception of property and relation. 

I BEING TWO AS A PLURAL PROPERTY 
On the standard conception of property and relation, there is no 
property like being two. I shall now explain two related principles in 


* One might object to this by identifying logical consequence with modal entail 
ment, defined via necessity (namely, maæapkyscal necessity, as ıt 3 commonly called). 
On the orthodox view on the modal status of sets, fact (3) modally entails fact (4); see, 
for example, Kit Fine, “First-order Modal Theories: Sets,” Mots, xv (1981): 177-205. 
This, however, gives no relief for the analysis in question. The fact that Bill is a human 
and the fact that {Bill} Aas exactly one member that ts a human modally entail each other, 
on the orthodox view, but the two facts cannot be identified, because the property of 
having xadh one meander that ıs a kuman cannot be taken to be the predicative compo- 
nent of the former fact. Thus, it is necessary to draw a distinchon among facts finer 
than the one available through modal entailment. My view is that logical conse- 
quence, among others, yields such a distinction. For more on this or sss, 
sec Fine, “Essence and Modality,” Philosophacal Perspectrues, Vit. (1994): 1-16; and Joseph 
Almog, “The What and the How,” this JOURNAL, LXXXVII, 5 (May 1991): 225-44. 
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the conception which rule out such properties, and argue that facts 
like (1) cannot be given a proper analysis without rejecting the prin- 
ciples. 

HI. The prineple of singular instantiahon. On the standard concep- 
tion, a property cannot be instantiated by many things as such; there 
is no such property as being two, which would be instantiated by two 
things as such. For a better understanding of reasons for rejecting 
such properties, it is necessary to be clearer about the content of the 
phrase ‘as such’. The phrase is necessary because the conception ac- 
commodates some properties that are in a sense instantiated by 
many things: for example, beng a human, which is in a sense instant- 
ated by, for example, Bill and Hillary. What is the difference be- 
tween the standard properties like this and those ruled out by the 
conception? 

One might attempt to explain it as follows. Being a human, instan- 
tiated (in a sense) by Bill and Hillary, is instantiated by every one of 
them; being two, by contrast, is not instantiated by every one of 
those, by whom it is supposed to be instantiated. But this explana- 
tion does not get at the heart of the matter. Being one or more humans 
would be instantiated by any one of those who instantiate it; either of 
Bill and Hillary, who are one or more humans (specifically, two hu- 
mans), is one or more humans (specifically, one human). But the 
standard conception rules out such a property as well. This is due to 
an important feature which it shares with, for example, being two 
and distinguishes them from the standard properties. 

Being a human must occur more than once to be instantiated by Bill 
and Hillary—whether to form two different facts, such as the fact 
that Bill is a human and the fact that Hillary is a human, or as differ- 
ent components of one complex fact, such as the fact that: 


Bill ts a Auman [the first occurrence], and Hillary is a Auman [the sec- 
ond occurrence]. 


By contrast, being two can (or could) occur just once to be instanti- 
ated by many things; it should be taken to occur only once to be in- 
stantiated by the two humans and, thus, form fact (1). It is the same 
with being one or more humans; this property can (or could) occur 
just once to be instantiated by Bill and Hillary—though it can also oc- 
cur two more times to be instantiated by Bill and Hillary, respectively. 
Thus, the standard conception contains the following principle: 
The principle of singular instantiation: there 1s no property that can 


occur just once to be instantiated by many things (that is, more than 
one thing). 
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II.2. The principle of singularity. The principle of singular instantia- 
tion is, I think, a consequence of an underlying principle that for- 
mulates the more general constraint that the standard conception 
imposes on relations as well as properties. To state this principle, it 
is useful to clarify some basics about objects, properties, and rela- 
tions. 

Bill, for example, is an object; being a human, a property; living m, 
a relation. There is a categorical difference between objects and 
properties or relations. Very roughly, properties or relations are pred- 
icable, objects are not. Being a human is predicated of Bill (specifi- 
cally, Bill instantiates the property); Bill is a Auman. Similarly, living in 
is predicated of Bill and Washington, so to speak, in the given order 
(specifically, Bill bears the relation to Washington); Bill Aves in Wash- 
ington. Bill, by contrast, cannot be predicated at all. He occurs in 
the fact that Bill is Bul, but not as a predicable; he occurs in it to 
bear the relation of identity, indicated® by ‘is’ in the sentence ‘Bill is 
Bill’, to himself.’ 

Properties are distinguished from relations by the number of argu- 
ment places. A property has one argument place; a relation, two or 
more. Being a human has only one argument place; it can be filled 
in with Bill to yield the fact that Bill is a human. Living in has two ar- 
gument places; they can be filled in with Bill (the first) and Washing- 
ton (the second), respectively, to yield the fact that Bill lives in 
Washington. 

The distinction between properties and relations, we can see, is 
not as fundamental as the one between objects and properties or re- 
lations. The difference between a property and a two-place relation 
is no greater than the one between the relation and a three-place re- 
lation, such as the one indicated by the predicate ‘to give...to’. So I 
shall henceforth use the term ‘relation’ in a broader sense to apply it 
to properties as well: properties are one-place relations, relations 
with one argument place. 

Now, consider what can fill in argument places of relations. The 
argument place of being a human can be filled in with (in short, ad- 
mits of) Bill because he instantiates the property. In general, what- 
ever instantiates a property can fill in its argument place. But the 
converse does not hold. The argument place of a property admits of 


tI use ‘indicate’ widely for the relation between any linguistic expression and its 
metaphysical counterpart, and reserve ‘refer’ for the more specific relation applic- 
able to nonpredicable expressions. 

’ There is also the property of berng Bill (composed of identity and Bill), but Bill 
differs from the property. 
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whatever instantiates its complement as well, because a property and 
its complement have the same kind of argument places. London, for 
example, can fill in the argument place of being a human, whose 
complement, the property of not being a human, is instantiated by 
the city. 

If so, what cannot fill in the argument place of a relation? The ar- 
gument place of being a human does not admit of-many things as 
such, It (in a sense) admits of Bill and Hillary, but for this to happen, 
the property must occur twice; its argument place admits of only one 
thing (that is, Bill) as the property occurs just once. And it is the same 
with relations like living in. The first argument place of this relation 
(in a sense) admits of Bill and Hillary to yield the fact that: 


Bill and Hillary kve in Washington. 


But the relation must occur twice for this to happen, as can be seen 
in the following analysis of the fact: 


Bill Aves en Washington, and Hillary ves in Washington. 


Now, on the standard conception, this is the general situation with 
any argument place of any relation whatsoever, that is: 


The principle of singulanty: an argument place of a relation does not 
admit of many things as the relation occurs ust once. 


We can see that this explains why, on the conception, the principle 
of singular instantiation holds; a property cannot occur just once to 
be instantiated by many things unless its argument place admits of 
them as the property occurs just once. 

H.3. The ground for plural relations. I shall now argue that the princi- 
ple of singularity does not hold. Given the argument, it is straightfor- 
ward to see that the principle of singular instantiation does not hold, 
either. 

Call an argument place of a relation plural if it admits of many ob- 
jects as such; singular otherwise. Accordingly, call a relation singular 
if all of its argument places are singular; plural otherwise. On the 
principle of singularity, there is no plural relation. I argue that there 
are plural relations because there are facts like the following: 


Bill and Hillary are two. (Fact (1)) 

Bul and Hillary are two humans. 

Bill and Hillary fue together. 

Russell and Whitehead coauthor a book. 

Some humans love only one another. 

The coauthors of Principia Mathematica (in short, PM ) cooperate. 
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Bill is not one of the coauthors of PM. 


The proper analysis of such facts, I claim, involve plural properties 
like: 

being two 

being two humans 

living together 

cooperating 

loving only one another 
and two-place relations like: 


coauthoring 
being one of 


whose first and second argument places, respectively, are plural. 

My argument for the claim runs as follows. (A) Being two, for ex- 
ample, is the predicative component of fact (1) (so it is a relation), 
indicated by the predicate ‘to be two’, that it shares with facts like 
the following: 


(5a) Dolly and Washington are two. 
(5b) Washington and Neptune ars two. 


(B) And it has only one argument place (so it is a property), which 
admits of Bill and Hillary to yield fact (1). (C) Finally, the argument 
place is a plural one because the property occurs only once in, for 
example, fact (1), in which the argument place is filled in with many 
things, for example, Bill and Hillary. 

One might essentially agree with (A) but object that the predica 
tive component of fact (1) is the singular two-place relation indi- 
cated by the phrase ‘... and ——— are two’.® This view might seem 
plausible if one considers only facts like (1) or (5a), but it fails to 
square with facts like the following: 


(6a) Bill, Hillary, and Washington are not two. 
(6b) Bull, Hillary, Washington, and Neptune are not two. 


These facts cannot be seen to contain a singular two-place relation. 
Their common predicative component, the relation of not beng two, 
must be taken to have only one argument place that admits of Bill, 
Hillary, and Washington (as such) to yield fact (6a); and Bill, Hillary, 
Washington, and Neptune (as such) to yield fact (6b). That is, not 


* For an account of number that rests on this view, see John Bigelow, The Reahty 
of Numbers (New York: Oxford, 1988), pp. 49-54. (I have added ‘essentially’ because 
he might object that the phrase, strictly speaking, does not indicate being two.) 


IS TWO A PROPERTY? 171 


being two is a plural property. Now, because this is the complement 
of the predicative component of fact (1), being two, this must also 
be a plural property. 

Some might concede that being two is not a two-place relation, 
one with two argument places built into it, but object that it is a singu- 
lar relation of a special kind, one that Henry Leonard and Nelson 
Goodman’ call a multigrade relation. On this view, being two (or its 
complement) is a relation that can occur in one fact with, for exam- 
ple, two argument places filled in while occurring in another fact 
with another number of argument places filled in. But this view 
meets difficulty with facts like the following: 


(7) There are some things that are not two. 


which shares the complement of being two with facts like (6a). How 
many, and what kind of, argument places does the relation occur 
with in fact (7)? The right answer to this question is: it occurs with 
only one argument place, a plural one. Then there is no reason to 
take the relation to occur with other, singular argument places in, 
for example, fact (6a). 

How else can one object to the above argument for plural proper- 
ties? It is hard to reject (C) while granting (A)-(B). One cannot ar- 
gue that being two occurs twice in fact (1), which cannot be analyzed 
as a conjunctive fact. Thus, to reject (C), one would have to argue 
that Bill and Hillary are in a sense one complex object, but this, we 
have seen, is not a correct view. Some might object that there is no 
fact corresponding to the construction “Bill and Hillary are two’, be- 
cause this is not grammatically correct. For, on their view, ‘two’ used 
as an adjective must be complemented with nouns (or noun 
phrases). But this view does not help much in defending the princi 
ple of singularity. One can apply the above argument to, for exam- 
ple, the fact that: 


(8) Bull and Hillary are two humans. 


and conclude that being two humans is a plural property. Because 
they cannot challenge this argument with the grammatical objec- 
tion, they might attempt to do so by defending alternative analyses 
of fact (8). But the analyses they might propose can be seen to result 


* See their “The Calculus of Individuals and Its Uses,” Journal of Symbolic Logic, V, 
2 (June 1940); 45-55. The linguistic counterparts of multigrade relations are 
called vanably predscates. For more discussions of the views examined ın this 
h and the previous, see my “Numbers and Relations,” Evkesntus, XLIX, 1 

(1998): 93-118 
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from different ways of analyzing fact (1), because fact (8) can be ana- 
lyzed as containing fact (1) as a conjunct as follows: 


Bill and Hillary are two, and they are humans. 


Gan one then defend a way to analyze facts like (1) or (8) without 
plural relations? One might hold that such facts contain only singu- 
lar properties instantiated by complex objects. On her view, fact (1) 
can be analyzed as follows: 


(2) {Bill, Hillary} is twoamembered. 


which results from the set {Biull, Hillary} filling in the argument place 
of the singular property of being twoamembered. But such an analy- 
sis cannot be right, we have seen, because fact (1) concerns nothing 
(that is, no object) but Bill or Hillary. 

To avoid this problem, one might attempt to analyze such facts in 
terms of properties rather than complex objects. They, one might ar- 
gue, are second-order facts like the one composed of (i) the singular 
(first-order) property of being identical urth Bul or Hillary and (ii) a 
second-order property that somehow corresponds to ‘two’, one that 
is instantiated by any singular property instantiated by two things.’ 
For one can argue (correctly, on my view) that this fact does not logi 
cally entail the existence of any object except Bill or Hillary; the exis- 
tence of a property, which it logically entails, does not logically entail 
the existence of an object that one might use as its representative. 

Because properties are not determined by their instances, how- 
ever, the scheme of second-order analysis meets difficulties in pre- 
serving logical relations among facts like the following: 


(9) Russell and Whitehead cooperate. 
(10) Russell and Whitehead are the authors of PM 
(11) The authors of PM cooperate. 


Consider the second-order facts that the scheme renders them as: 


(9a) cooperate (is identical with Russell or Whitehead)" 

(10a) Vala is identical with Russell or Whitehead © a is an author of 
PM] 

(lla) coopersts (18 an author of PM) 


* This is the kind of analysis that would be proposed by, for example, Michael 
Dummett, who argues that “a plural noun-phrase...under a correct analyzls...1s seen 
to figure predicatively"—Prege Philosophy of Mathematics (Cambridge: Harvard, 
1991), p. 98. 

j The sentence used here is a second-order predication whose predicate 1s 'œwop- 
erate’, whose argument place is filled in with the predicate ‘is identical with Russell 
or Whitehead’. 
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Although the properties indicated by the phrases ‘is identical with 
Russell or Whitehead’ and ‘is an author of PM’ are co-extensional 
(so, fact (10a) holds), they are not the same property; to be identical 
with Russell or Whitehead is not the same as to be an author of 
PM. If so, one of the properties may instantiate a second-order 
property—for example, the one indicated by ‘cooperate’—that the 
other does not. Thus, (11a) is not a logical consequence of (9a) and 
(10a).!* But, clearly, (11) is a logical consequence of (9) and (10). 
To avoid this problem, one might analyze facts like (9) not as 
second-order predications but as second-order existential generaliza- 
tions like: 
(9b) JX[Va(X(a) + a 1s identical with Russell or Whitehead) A 
cooperate (X) |" 


But this modified scheme fails other logical relations. Consider the 
following: 


(12) Russell and Whitehead are philosophers who coauthor PM. 
(13) There are some philosophers that cooperate and coauthor PM. 


The scheme renders fact (12) as follows: 


(12a) AX[Va(X(a@) + a is identical with Russell or Whitehead) A 
[Wa(X(a) > a isa philosopher) A cowmthor (X, PM)]] 


This fact and fact (9b) do not logically entail the second-order coun- 
terpart of fact (13): 


* One cannot defend the scheme by adopting the extensional conception of 
relation (which is implausible given properties hke the above-mentioned) that 
identifies co-extensional relations. (lla) is not a logical consequence of (9a) 
and (10a) unless the “identity” between the first-order properties is a logical 
consequence of their extensional equivalence. But it is one thing to hold the 
extensional conception; quite another to hold (more implausibly) that its truth 
relies on logic alone. 

D One might analyze (9) as the fact that Russell cooperates-with Whitehead, and 
Whitehead cooperates-with Russell. This analysis is not applicable to fact (11), 
which has the same predicative component. One can avoid this problem by analyz- 

facts like (9) and (11) ın terms of the second property indicated by ‘Vaf.. a .. 
+ AB (atp A a cooperateswith 8)]', whose argument place (indicated by *...a...’) 
admits of a first-order property (for example, being identical with Russell or White- 
head). But this analysis fails because pairwise cooperation of, for example, three 
humans does not logically entail their cooperation: any two of them may cooperate 
(for example, to write a book between them while excluding the other, or to kill 
the other) while not all three of them cooperate. 

4 This 1s the key idea in Ruzesell’s “No-Class Theory,” on which the talk of sets (or 
“classes”) is reducible to the talk of properties (or “propomtional functions”); see, 
for ie his Introduction to Mathematical Philosophy (London: Allen and Unwin, 
1919), p 187f. 
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(18a) AX[Va(X(a) > a is a philosopher) A [coopenats (X) A coauthor 
(X, PM)]] 


But, clearly, facts (9) and (12) logically entail fact (13). 

Now, some people might concede that fact (9) contains a plural 
property (and thus, reject the principle of singularity) but still ob- 
ject that fact (1) should be analyzed as a second-order fact whose 
predicative component is a special kind of second-order property, 
the one that corresponds to the quantifier phrase ‘Saif (a#+ß A 
vyl... y... © (y=a V y=f)])’. On their view, the phrase ‘Russell 
and Whitehead’ functions ambiguously in ‘Russell and Whitehead 
are two’ and ‘Russell and Whitehead cooperate’; and the predica- 
tive components of the facts stated by them cannot combine to 
yield a complex property, because they do not have the same kind 
of argument places. But there is nothing wrong with contracting 
the two sentences as if the phrase functions unambiguously 
therein, and the resulting sentence states a logical consequence of 
the two facts: 


Russell and Whitehead are two and cooperate. 
which logically entails the fact that: 
There are some things that are two and cooperate. 


Being two, like cooperating, must be seen to occur as a plural prop- 
erty in this fact—as a component of the complex property: being two 
and cooperating. Then facts like (1), too, must be analyzed in terms of 
the plural property given their connection to the more complex 
facts like the above. 

This completes my argument against the principle of singularity. It 
should be clear from the argument that the principle of singular in- 
stantiation must also be rejected. Not every property that violates the 
former principle violates the latter principle; being one has an argu- 
ment place that admits of, for example, Bill and Hillary as such, but 
is not instantiated by them as such. But the properties mentioned in 
the argument are counterexamples to the latter principle as well. Be- 


* One might connder analyzing, for example, fact (9) as follows: 


3X(Va[ X (a) + a is identical with Russell or Whitehead] A 
VY [Va(X(a) + Y(a)) > coopocets (Y)}) 


This relies on the assumption that the property indicated by ‘cocpeness’ is instanti- 
ated by any property co-extensional with a property that instantiates it. But one 
cannot assume this without analyzing the second-order property. For more on sec- 
ond-order treatments, see subsection II 2.C below. 
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ing two, for example, is instantiated by any two things as such; its 
complement, by any things more than two. 

Notice also that this last point establishes that the argument place of 
being two admits of any two or more things as such—no matter how 
many they are, whether they are finitely many or infinitely so. More- 
over, the argument place, like that of being one, admits of any one 
thing—for example, Bilt—as well. For any one thing instantiates the 
property of not being two because there are facts like the following: 


Bill ts not two [in particular, not two humans]. 
Bill ts not iwo, but only one [namely, one human]. 


M THE LANGUAGE AND LOGIC OF PLURALS" 

In the next section, I shall elaborate on the nature of plural relations 
and confirm the intuitions adduced against alternative analyses of 
facts like (1), such as the intuition that fact (1) is a logical conse- 
quence of fact (3). I prepare for the task here by clarifying the syn- 
tax, semantics, and logic of the language I shall use for that purpose. 
The language contains plural, as well as singular, constructions, 
which allows one to use it to refer to many things as such and indi- 
cate and analyze plural, as well as singular, relations. We are familiar 
with the language. It is English (or its relevant fragment). 

HY. 1. Informal syntax and semantics. English has: singular terms, such 
as the proper name ‘Bill’, the definite description ‘the author of Aca- 
demica’, or the pronoun ‘it’; singular predicates, predicates all of whose 
argument places are stngular (namely, they admit of only a singular 
term), such as ‘to be a human’; and singular quantifiers, such as 
‘something’ (or “There is something that’). In addition, it has: plural 
terms, such as the conjunctive plural term ‘Bill and Hillary’, the 
plural definite description ‘the authors of PM’, or the plural pro- 
noun ‘they’; plural predicates, predicates some of whose argument 
places are plural (that is, they admit of a plural term’), such as one- 
place predicates like ‘to be two humans’ or ‘to cooperate’, or two- 


* Infinitely many things (for example, the natural numbers) cannot (as such) be 
indicated by a plural term resulting from joining their names using ‘and’, but the 
absence of a term no more makes the things unsuitable for the plural prop- 
erty than the absence of a term referring to a pebble makes the pebble unsuitable 
for a angular property. 

” For elaborations of discussions in this section, see my “The Language and 
Logic of Plurals,” Synthese (forthcoming). 

Plural argument places may admit of a singular term as well; recall, for exam- 
ple, ‘to be two’. (One might distinguish them further into neutral and exclusroely 
ones.) Most Enghsh predicates, on my view, have such an argument place; 
Cicero writes Academica’ and ‘Russell and Whitehead wnte PM’ have the same 
predicate 
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place predicates like ‘to be one of’, ‘to write’, or ‘to carry’; and plural 
quantifiers, such as ‘some things’ (or “There are some things that’). 
Thus, the language contains sentences like ‘Bill and Hillary are two 
humans’, ‘Russell is one of the authors of PM’, and “There are some 
things that are two humans, and they cooperate and write a book’, as 
well as sentences like ‘Bill is a human’, ‘Cicero is the author of Acade- 
mica’, and “There is something that is a human, and he writes a 
book’. 

Plural terms must be sharply distinguished from (first-order) pred- 
icates. These are predicable expreasions that can combine with non- 
predicable expressions to form a sentence. The predicate ‘to be a 
human’ combines with the singular term ‘Bull’ to yield the sentence 
‘Bill is a human’. By contrast, plural terms, like singular terms, are 
nonpredicable expressions. ‘The plural term “Bill and Hillary’ cannot 
combine with ‘Bill’ to yield a sentence, just as the singular term ‘Bill’ 
cannot combine with ‘Bill’ to yield a sentence. Nonpredicable ex- 
pressions can constitute a sentence—for example, ‘Bill is one of Bill 
and Hillary’ or ‘Bill is Bill’—~only by combining with predicable ex- 
pressions—for example, ‘to be one of’ or ‘to be’. 

Accordingly, different kinds of semantic relations are appropriate 
for plural terms, on the one hand, and predicates, on the other. The 
relation of reference is appropriate for nonpredicable expressions; be- 
ing true of, for predicables (of one argument place). The predicate 
‘to be a human’ (or ‘to be one of Bill and Hillary’) is true of Bill, but 
does not refer to him; the singular term ‘Bill’, by contrast, refers to 
Bill, but is not true of him. How about plural terms? They are refer- 
ential expressions. The plural term ‘Bill and Hillary’, for example, is 
not true of anything (or any things). It refers to some things, 
namely, Bill and Hillary (as such).™ This does not mean that the 
plural term refers to Bill and also to Hillary; it refers to neither of 
them. A typical”! plural term refers to some things without referring 
to any one of them. 

We can characterize the truth of plural, as well as singular, predi- 
cations in terms of the semantic functions of terms and predicates. 


” There 1s another semantic function of predicables: they indicate relations. We 
need to consider thus function to characterize the truth of sentences with, for ex- 
ample, second-order predicates (except first-order quantifiers), but the fragment 
of English here in question contains no such predicate. 

SS lal cael agli ba nag aa lente AA OER 
tion; likewise with ‘to be true of" as used below to be applicable to p predi- 
cates. 

" This qualification is necessary because some plural terms refer to only one 

‘Cicero and Tully’ is a plural term, but refers to only one thing: Cicero, that 
is, Cicero and Tully, who are not many. 
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The singular predication “Bill is a human’ is true if its predicate is 
true of Bill, to whom the term ‘Bill’ refers. To give a similar account 
of plural predications like ‘Bill and Hillary are two’, we need to con- 
sider the semantics of plural predicates like ‘to be two’. One-place 
plural predicates can be true of some things: the predicate ‘to be two 
humans’ is true of any two humans—for example, Bill and Hillary; 
‘to be two’, any two different things—for example, Washington and 
Dolly; ‘to coauthor a book’, any writers that write a book together— 
for example, Russell and Whitehead; ‘to be one’, any one thing—for 
example, Cicero. Now, the plural predication ‘Bill and Hillary are 
two’ is true if its predicate ‘to be two’ is true of Bill and Hillary, to 
whom the term ‘Bill and Hillary’ refers. We can extend this account 
to define the satisfaction and truth of sentences in the fragment of 
English in question, using plural constructions in the metalanguage, 
but it is not necessary to do so for the present purpose. 

II.2. Regimentahon and logic. To give a proper treatment of logical 
relations among sentences in the fragment of English that contains 
plural constructions, it is useful to regiment the fragment. Regiment- 
ing facilitates formulation of logical relations among sentences in it 
by imposing austere syntax. Let me call the regimented fragment of 
English the language of plurals, and the logic that pertains to sen- 
tences in the language (and, thus, their English counterparts) the 
logic of plurals. The language extends elementary language, the regi- 
mented language developed in tandem with elementary logic, by in- 
cluding (counterparts of) plural constructions; accordingly, the logic 
of plurals extends elementary logic to do justice to logical relations 
involving plural constructions. 

Ht. 2.A, The language of plurals. The primitive expressions of the lan- 
guage of plurals are: 


I. Primitrve Terms 
Singular Constants: for example, ‘Bill’ (‘b’), “Hillary” (‘h’) 
Singular Venables: for example, ‘a’, ‘P’ 
Ptural Vanables for example, ‘as’, ‘Bs? 
IL Term Connective: ‘@' (“AND’) 
I. Predicates 
Singular Predicates 
logical ‘=’ (“is-identicabwith’) 
nonlogicak for example, ‘is-a-human’ (‘H’), ‘admires’ (‘A’), ‘is 
a-member-of” (‘e’) 


= Plural variables correspond to plural pronouns as used anaphorically in sen- 
tences like "There are some humans, and they cooperate’ I put ‘s’ to make the vari- 
ables look like nouns of English in plural forms, but the variables are mmple symbols. 
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Phural Predicates 
logat ‘H’ (‘IS-ONE-OF) 
nonlogicak for example, ‘COOPERATE’, ‘WRITE’, ‘SING- 
TOGETHER’ 
IV. Sentential Connectives: ‘~’ (it-is-not-the-casethat’), ‘A’ (‘and’) 
V. Quantifiers 
The stngular eastential quantifier T’ (‘thereis-something...such-that’) 
The plural castenthal quantifier’) (‘there-are-some-things...suchthat’) 
The language officially contains only the existential quantifiers and 
two sentential connectives. But the universal quantifiers ‘V’ (‘any- 
thing...is-such-that’) and ‘TI’ (‘Anythings...are-such-that’) and other 
sentential connectives that I shall use can be introduced via the 
usual definitions. 

It is not necessary to elaborate on the rules of formation except 
for some clarifications. First, the plural argument places of predi- 
cates admit of any singular, as well as plural, term. Thus ‘Bill IS- 
ONE-OF Bill’ is grammatically correct, though the second 
argument place of its predicate is plural.” Second, the term con- 
nective ‘@’ accepts any two (occurrences of) terms to form a com- 
plex plural term. Thus ‘[Cicero@Tully]’ and ‘[Cicero@Cicero]’ are 
plural terms. Third, the first argument place of the plural (or sin- 
gular) quantifier, marked by ‘...’, admits of only a plural (or singu- 
lar) variable. 

In addition to the predicate ‘=’, familiar from elementary 
logic, the language of plurals has another logical predicate: ‘IS- 
ONE-OF’. This predicate, as a regimentation of the predicate ‘to 
be one of’ in English, counts as a logical predicate for the same 
reason as the predicate ‘=’ does so: some logical relations rely on 
the specific nature of the predicate (or its English counterpart). 
For example, ‘Cicero ts one of Cicero and Caesar’ is a logical 
truth; ‘Cicero ts one of Tully and Caesar’ and ‘Cicero ts one of the 
Roman philosophers’ are logically equivalent with ‘Cicero is Tully 
or Caesar’ and ‘Cicero is a Roman philosopher’, respectively. The 
logical relations can be lost if the predicate is replaced with an- 
other. 


” Its first argument place is singular. One might object that the English predi 
cate accepts only a plural term ın its second argument place. If so, we can define a 
new predicate ‘is-one-of’ with the old predicate: 


sirone-of t=, Vals is one of [t AND a]], where tis any term 


and introduce the counterpart of the new predicate as a primitive in the regi 
mented language. 
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Given the predicate, we can define a plural predicate, whose +th 
argument place is plural, on the basis of a predicate @ whose Hh ar- 
gument place is singular as follows: 


PO (hs by, AS boes la) ma Jp[H (9, as) > P (bso bi p, bte 
4) ], where 4,..., 41, 44... ¢, are terms appropriate for the relevant ar- 
gument places of ¢@ and £ occurs free in none of ¢ and 4,..., 41, 


T 


Call the predicate ¢™ (or simply ¢” if ¢ is one-place) the (+h) 
plural expansion of the base predicate @. Sentences like ‘Bill and 
Hillary are humans’ and ‘They are humans’ can be seen to be 
formed by the plural expansion ‘is-a-human” (in symbols, ‘H”’) of 
the singular predicate ‘isa-human’. We can then explain the logi- 
cal equivalence between, for example, the plural predication ‘Bill 
and Hillary are humans’ and the sentential conjunction ‘Bill is a 
human, and Hillary is a human’ as resulting from the definition of 
the plural expansion; the plural expansion is true of any things 
(for example, Bill and Hillary) each of which the singular base is 
true of. 

Now, plural definite descriptions like ‘the humans’ (in symbols, 
‘(zw) H?(w)’) can be seen to result from applying a description oper- 
ator to a plural expansion. This description, for example, can be de- 
fined in contexts in which it occurs primarily as follows: 


fl (cw) A"(w)] m, Las [VA (H(A, as) + H(B)) A dlasfl™ 


H1.2.B. The logic of plurals. I shall now present logical truths in the 
language of plurals in an axiomatic way.” To do so, it is convenient 
to use schematic letters for different kinds of expressions (or sets 
thereof) in the language: ‘s’ for singular terms; ‘t’ and ‘w for any 
terms; ‘v’ and ‘w’ for singular and plural variables, respectively; ‘o’ 
and ‘p’ for sentences; and ‘I’ for sets of sentences. (They are all 
used with or without numerical subscripts.) Say that ¢ is a plural unt 
versal generalization of w if and only if either ¢= or P='Q,u,Q,ty 


™ Ott] is used for sentence ¢ in which term t occurs primarily There 1s an- 
other kind of plural definite descriptions—for example, ‘the coauthors of PM’ or 
‘the things that something is a human if, and only if, ıt is one of them’ (which 
‘the humans’ can be taken to abbreviate). Such descriptions, too, can be 
phrased into the language of plurals, though somewhat differently “The coauthors 
of PM cooperate’ can be paraphrased by “There are some things that are coauthors 
of PM, they are the same things as any things that are coauthors of PM, and they 
ee te’ 
= Although I shall also talk of other logical relations (for example, logical impl+ 
cation), such talks are reducible to talks of logical truth ın the usual way 
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..Q,u,i/', where Qi, Q,,.... Q, are the singular or plural universal 
quantifiers whose first argument places admit of the respective vari- 
ables t4, ty... u,.™ Then the axioms of the system are the plural uni 
versal generalizations of instances of the following schemas:”’ 


Group A 
Truth-functional tautologies 
h(s) + dud(v), where sis substitutable for vin ġ(v) 
Vulp(v) > #(v)] > [Vve (v) > Vog(v)] 
$ > Vud, where v does not occur free in d 
s=s, > [ġ > ¢,], where ġ is an atomic sentence and œ, is ob- 
tained from ¢ by replacing sin some of its occurrences by $ 


Group B 
VI. (d > Lwd(w), where tis substitutable for win (w) 
VI. = Unlo(w) > #(a)] > [Ilwd(a) > wd (w) ] 
VIL d—IIwd, where w does not occur free in d 


<AHEN 


Group C 
IX. H(s, [@u]) + [H(s ) V H(s u)] 
x Hiss)” 5 
XL. JAvH(v, w) 


D 
XU. Vv[H(v, ġ e H(v, u)] > [ġ > ¢,], where v does not occur free 
in tor u d is an atomic sentence, and @, is obtained from ¢ by 
replacing ¢in some of its occurrences by u 
XI. dAvd(v) > LwVv[H(v, w) + d(v)], where w does not occur free 
in (v) 


The schemas of Group A are familiar from elementary logic.* Those 
of Group B are plural counterparts of schemas U-IV. Group C contains 
schemas that concern mostly the logic of the predicate ‘H’. The rest 
are contained in Group D. 

We can do with one rule of inference, modus ponens: from œ and ! [@ 
—> p] |, we may infer y. There are logical truths of the language we 
cannot obtain by applying the rule repeatedly to the axioms. How 
can we then distinguish such logical truths from its sentences that 
are not logically truer I do not give a full answer to this question 


= Recall that the universal quantifiers are defined as follows: 


Vod(v) my, ~Ju~p(0); wp (m) my, ~La~$(m) 


v The definitions of ‘t is substitutable for u in @(«)’ and ‘t occurs free in œ’ 
(where tand «are variables) are straightforward. 

™ They are taken from Herbert Enderton, A Mathematical Introduction to Logic 
(New York: Academic, 1972), pp. 104£f. (The schema ‘s=s’ is derivable grven V and 
X-XL.) 
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here, because the logic of the language cannot be given a complete 
axiomatization.™ For present purposes, it suffices to give a partial an- 
swer: the logic is a conservative extension of elementary logic, that is: 


Conservation principle: a sentence ¢ that belongs to an elementary 


language (or the singular fragment of the language of plurals) is logt 
cally true if and only fit is logically true on elementary logic. 


Thus, sentences like ‘b&h- Jalea A hea)’ or ‘[Vy(A(g y) > y=) 
AVY(A(t y) > y=e)] > da[eea A ka)’ are not logically true even in 
the language of plurals; they are not so on elementary logic. 

ITT. 2.C. Remarks on the logic. The conservation principle helps to ar- 
gue against the prevalent view that to assert plural quantifications 
like the following, known as the Geach-Kaplan sentence, is to com- 
mit one to the existence of sets (or other complex objects) :” 


(14) There are some critics who admire only one another (in symbols, 
SasvVBLH(G, as) + CA) AVy(AG, Y) > B#y A H(y, a5)))). 


The view rests on paraphrasing such sentences into singular quantifi- 
cations like the following: 


(15) There is something (for example, a set) every member of which 
is a critic and admires only its other member (in symbols, 


da (ABfpea A VB[ Bea > CB) AVy (ACB, Y) + B#y A yea))). 


But this is not a correct paraphrase; (14) and (15) are not logically 
equivalent. To see this, consider the following: 


(16) Ezra is a critic, Thomas is a critic, Ezra is not identical with 
Thomas, Ezra admires only Thomas, and Thomas admires only 
Ezra (in symbols, [C(e) A C(t)] A eFt A [Vy(Ala y) > Y=DA 
Vy(A(t, y) > y=a)]). 


* The language contains sentences like ‘Some positive natural numbers are succes- 
sors of only one another’, which amounts to the negation of the full induction - 
ciple that helps to yield a categorical characterization of natural numbers. The logic 
of languages that contam the logical resources of the language presented above and 
a predicate (for example, ‘is a successor of or ‘admires’), which helps to 
state simulacrum of a sufficient fragment of the theory of fimite sets, cannot be 
given a full axiomatic characterivation. For a full model-theoretic characterization of 
the logic, see my “The Language and Logic of Plurals,” section IV. 

™ For this view, see W. V. Quine, Methods of Loge (London: Routledge, 1972, 3rd 
edition), p. 239, and his The Roots of Reference (La Salle, IL’ Open Court, 1973), p. 111. 
See also Michael D. Resnik, “Second-order Logic Sill Wild,” this JOURNAL, LXXXV, 2 
(February 1988): 75-87; and Allen P. Hazen, “Against Pturalsm,” Australanan fowrnal 
of Philosophy, LXXI, 2 (June 1993): 132-44 For other, less direct criticsms of the view, 
see George Boolos, “To Be Is to Be a Value of a Vartable (or to Be a Value of Some 
Vartable),” this JOURNAL, LXXXI, 8 (August 1984): 430-49, and his “Nomunalist Platon- 
um,” Phulosophscal Review, XLIV, 3 (July 1985): 327-44; see also Lewis, Parts of Classes 
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(16) does not logically imply (15), by the conservation principle, be- 
cause it does not do so on elementary logic. But (16) logically im- 
ples the following, which logically implies (14), by schema VI: 


(17) Ezra and Thomas are critics who admire only each other (in sym- 
bols, VA[H(B, [e@d]) > C) A YYW, y) > BF yA Hy, 
[e@i]))]). 


We can see some important features of the logic of plurals 
through schema VI, which we can use to show that (17) logically im- 
plies (14). Sentence (14) can be seen to result from replacing the 
italicized plural terms in the following sentences with the plural 
quantifier: 


Exra and Thomas are critics who admire only each other (sentence 
(17)). 

Ezra, Thomas, and Bill are critics who admire only one another. 

Erra, Thomas, Bill, and Hillary are critics who admire only one 
another. 


Thus, by the schema, any of these sentences logically implies (14). 
Other instances of its antecedent include the following (or their 
paraphrases): 


John and Carol are i 
Russell, Moore, and Whitehead are famous humans who are three 


philosophers. 
Bul, Bons, Tony, and Howard are pohticians who cooperate. 


Thus, we can use the schema to show that they logically imply the 
following, respectively: 


Notice that sentences like the following, too, are instances of the 
schema, because the schematic letter ‘t’ can be replaced with a sin- 
gular term as well: 


If Cicero is a Roman who is a philosopher, some Romans are philoso- 


phers (in symbols, [R?(c) A P?(c)] > Las[R?(as) A P?(as)]).™ 


” Thus, ‘Las{R*(as) A PP(as)]’, ‘Sal Ra) A PP(a)]’, and “dal R{a) A Pla)’, 
which can be taken to paraphrase ‘Some Romans are philosophers’, are logically 
equivalent 
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Some might object that these are not logically true, because ‘Some 
Romans are philosophers’ implies the existence of at least two Ro- 
mans. But this is not a correct view. The plural quantification can- 
not logically imply ‘There are at least two Roman philosophers’; it 
is logically implied by, for example, ‘Cicero and Tully are Romans 
who are philosophers’, which is logically implied by ‘Cicero is a 
Roman who is a philosopher, and Tully is a Roman who is a 
philosopher’. 

Schemas XI-XII help to distinguish the logic of plurals from sec- 
ond-order logic. Schema XII is the plural counterpart of schema V, 
which states the principle of substitutivity of identity (for singular 
terms). Schema XII states the principle that, if some things are the 
same things as some things and the former are so-and-so, then the 
latter, too, must be so-and-so; its antecedent gives the condition for 
some things (for example, Russell and Whitehead) to be the same 
things as some things (for example, the authors of PM). Thus, we 
can use the schema to show that sentences like the following are log- 
ically true: 

If Russell and Whitehead are the authors of PM, and Russell and 
Whitehead cooperate, then the authors of PM cooperate. 


If the children who carry a piano are John and Carol, they carry it if 
and only if John and Carol carry it. 


Consider schema XII. The following sentences, for example, are 
instances of its antecedent and consequent, respectively: 


(18)There is a philosopher who writes something (in symbols, 
da[P(a) A AB Wa, 8)]). 

(19) There are some things such that something is one of them if and 
only if it is a philosopher who writes something (in symbols, 
Lysva(H(a, ys) + [P(a) A 38 Wa, §)]))). 


Thus, by the schema, (18) logically implies (19). Using the schema, 
we can also show that “There is a politician in Washington’, for ex- 
ample, logically implies “There are some politicians in Washington 
such that any politician in Washington is one of them’; this sentence 
is logically equivalent with the consequent of its appropriate in- 
stance. 

Notice that sentence (19) can be abbreviated as follows: 


(19a) There are the philosophers each of whom writes something. 


for if some things are such that something is one of them if and only if 
it is a philosopher who writes something, they are the philosophers 
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each of whom writes something (and vice versa). In general, the com- 
sequent of schema XIII can be abbreviated as ‘Lu, [w= (uow), 
where the predicate ‘=’ is defined as follows: 


imum, VulH(v, i) + H(u, u)], where v does not occur free in tor u 


Thus, we can take the schema to state a principle giving a logically 
sufficient condition for there to be some things (for example, the 
philosophers each of whom writes something): if there is some- 
thing that is so-and-so, there are the things each one of whom is so- 
and-so. Notice that the necessary condition is also sufficient; the 
converse of the schema holds as well, which we can show using 
schema XI: 


XIV. LwVv[H(v, w) + ġ(v)] > dvd(v), where w does not occur free 
in ġ(v) 


Thus, the antecedent of XUI is essential to the schema; we cannot do 
without the antecedent to get a stronger schema of logic. Most in- 
stances of its consequent are not logical truths; sentence (19) is nota 
logical truth. Moreover, the instances include logical falsities, such as 
‘LasV B[H (6, as) e B#]’, which logically implies ‘ABB#/’. 

One might attempt to preserve schema XIII by treating plural 
terms and quantifiers as predicates and second-order quantifiers, re- 
spectively, because instances of the following schema, the second-or- 
der counterpart of XIII, are logically true: 


Xa. SvX(v) > SXVv[ X(v) e 6(v)], where X does not occur free 
in ġ 

But this is not sufficient to reduce the logic of plurals to second- 
order logic. The converse of XMa, that is, the second-order coun- 
terpart of XIV, does not hold: ‘AXVa[X(a) © a#a]’, which is 
logically implied by ‘Vala#a + a#a]’ on second-order logic, is 
true; but ‘Jaata’ is false. We cannot derive the converse of Xia, 
notice, because the second-order counterpart of schema XI 
(namely, ‘dvX(v)’) does not hold; ‘daa¥#a’ is logically false. More- 
over, notice that second-order logic contains a schema stronger 
than XMa: 


Xb. AX Vv X(v) + o(v)], where X does not occur free in d 


which we can use to get the negation of ‘V Xia X(a)’, a second-order 
universal generalization of an instance of the schema ‘dvX(v)’. Thus, 


* To see this, apply the definition of plural definite descriptions. 
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one cannot take plural quantifiers as second-order quantifiers in an 
alternative notation.” 

It is usual to circumvent this well-recognized discrepancy between 
the logic of plural quantifiers and the logic of second-order quant- 
fiers by paraphrasing plural quantifiers like ‘Some things are such 
that...” into second-order notation using restricted quantifier 
phrases like ‘3JX[JaX(a) A ...]’. This scheme of paraphrase is objec- 
tionable on syntactic grounds, because applying the scheme to sen- 
tences like ‘John and Carol carry a piano’ or ‘A book is written by 
Russell and Whitehead’ fail to preserve the fact that they share 
predicates with sentences like ‘John carries a piano’ or ‘A book is 
written by Cicero’. Still, some might defend the scheme by para- 
phrasing ‘A book is written by Cicero’, for example, by the second- 
order sentence ‘J8(B(8) A AX[daX(a) A Va(X(a) © a=c) A 
weite( X, #)])’. They might argue that this suffices to account for the 
logic of plurals, if not their syntax, because, for example, this para- 
phrase, like their paraphrase of ‘A book is written by Russell and 
Whitehead’, logically implies their paraphrase of ‘A book is written 
by some things’. 

But the second-order scheme cannot preserve the logical relations 
among plurals encapsulated in schema XII. Consider arguments like 
the following, whose logical validity is captured by the schema: 


(20) Russell and Whitehead cooperate. 
(21) Russell and Whitehead are the authors of PM. 
Therefore, 


(22) The authors of PM cooperate. 


To account for their logical validity by applying the scheme to sen- 
tences like (20) and (22), it is necessary to appeal to the second- 
order counterpart of schema XII: 


Xa. Vol X(v) + Y(v)] > [ġ > $], where X and Y are second-order 
variables, ¢ an atomic sentence, and $, obtained from @ by re- 
placing Xin some of 1ts occurrences by Y 


” Neither can second-order languages, governed by second-order logic, be taken 
to provide alternative notations for plural constructions—pace Boolos, who pro- 
poses to interpret the formulas in second-order “languages” by rendernng them 
into the language of plurals. He reads the second-order quantifier “1X” as ‘there 
are some things such that...them...’, and argues that this leads to a defense of the 
logical status of X][b—see his “Reading the Begriffsschrift” Mind, xav, 375 (July 
1985): 831-44, here p. 340. But the plural reading falls short of justifying XOIb, 
whose instances, on that reading, include logical falsities. 
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But this schema cannot be considered a schema of logical truths in 
languages that contain second-order predicates like ‘cooperate’, which 
one needs to obtain the suggested paraphrases of sentences like 
(20).% Let the second-order predicate ‘of indicate the relation of 
“identity” between properties, and the first-order predicates ‘A’ and 
‘B the property of being identical with Russell or Whitehead and 
that of being an author of PM, respectively. Then ‘of(A, B)’ is false, 
though both ‘of(A, A)’ and ‘Va[A(a) ++ B(a)]’ are true. Other 
second-order predicates give rise to the same problem: because the 
extensional equivalence between, for example, the properties indi 
cated by the predicates ‘A’ and ‘B’ does not logically imply their 
“identity,” the predications that attribute, for example, the relation 
indicated by ‘cooperate’ to the properties are not logically equivalent 
even granting their extensional equivalence. 

To defend schema XIIa, one might argue that second-order predi- 
cates generate Opaque contexts. Schema XIa cannot be taken to 
warrant substituting ‘is Tully’ (salva verntate) for the co-extensional 
predicate ‘is an author of Academica’ in the following sentence: 


(23) Bill thinks that Cicero is an author of Academica. 


just as schema VI does not warrant substituting ‘Tully’ for ‘Cicero’ in 
the sentence. But it is hard to appeal to restrictions like this to de- 
fend schema XMa. Sentence (23) cannot be analyzed as a predica- 
tion composed of ‘Cicero’ and a first-order predicate (or ‘is an 
author of Academica’ and a second-order predicate) because ‘Cicero’ 


* In Plurals and Events (Cambndge: MIT, 1993), Barry Schein presents a second- 
order scheme that does without such predicates. The scheme renders, for exam- 
ple, ‘they cooperate’ to a vt at quantification over events 'Je[ sis a cooperation 
event A Va (A (e a) © T(a@))]’, where ‘T’ 1s a second-order vanable corresponding 
to the pronoun ‘they’ and ‘A(s a)’ amounts to, roughly, ‘a is an agent of e. This 
scheme is subject to a version of the so-called Russell’s paradox. The scheme ren- 
ders ‘Xs are as many as Y's’ to “de[ AMA(#) A Va(A(e a) e Xla)) A YVa(O(6 a) e 
Y{a))]’, where 'AMA(e)’ and ‘O(s, a)’ amount to, roughly, ‘e is an event whose 
agents are as many as its objects’ and ‘a is an object of ¢’, respectively. Thus, Schein 
must take the following, which amounts to the true sentence ‘Any things are as 
many as themselves’, to hold: 


WX(SaX(a) > Je[AMA(s) A Va (A (e a) + X(a)) A Va(O(¢ a) + X(a))] 


This leads to a contradiction, assuming “ta~AMA(a)’. Its instance obtained by the 
predicate ‘F’ defined as follows leads to a contradicnon, because ‘daF(a)’ holds: 


A a A AMA(B) A Wa(A(f, a) + X(a)) A Va(O(B, a) + 
a 


‘LAMA(s8) A Va(A(e, a) + Fla)) A Va(O(e, a) + F(a)))]’, which implies 
‘WX[Va(A(e, a) + X(a)) > Va(X(a) + Aa))]’, implies [HÀ + ~F{a)]’ 
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(or ‘is an author of Academica’) occurs in an opaque context, gener- 
ated by ‘thinks’, in (23); but the second-order sentences considered 
in the above paragraph are all second-order predications. Some peo- 
ple might perhaps argue that opaque contexts are more prevalent, 
that they are generated by all second-order predicates (except first- 
order quantifiers),* but this defense renders the schema useless in 
accounting for the logical relation in question among sentences like 
(20)-(22). 
IV. VARIETIES OF PLURAL RELATIONS 

I shall now discuss various kinds of plural relations, with concentra- 
tion on plural properties. In particular, I shall analyze a special kind 
of plural properties, the pure numerical properties, in terms of logi- 
cal resources only, and confirm the intuitions to which I have ap- 
pealed. 

Being two, recall, is instantiated by some things, but not by any- 
thing. It is the same with most of the plural properties discussed 
above. It is useful to focus on such properties in arguing against the 
principle of singularity. But, of course, there are other kinds of 
plural properties. Being one (or being one human), for example, is 
instantiated by something, for example, Bill, but not by any two or 
more things (as such). And there are properties instantiated by some 
one thing as well as by some two or more things (as such): for exam- 
ple, not being two, or wnting a book, a property instantiated by Cicero 
and also by Russell and Whitehead (as such). Finally, some plural 
properties are not instantiated at all; as the singular property of be- 
ing a Martian philosopher is not instantiated by anything, the plural 
property of being two Martian philosophers is not instantiated by 
any one or more things. 

Next, recall the plural expansions like ‘isa-human’”’ (in symbols, 
'H®™) of singular one-place predicates like ‘is-a-human’. The plural 
expansions indicate plural properties that I call the plural expansions 
of their singular bases, the properties indicated by the singular predi 
cates. For example, the property that I call beng human[s], indicated 
by ‘is-a-human”’, is the plural expansion of being a human. Now, any 
plural expansion is distributive, that is, it is instantiated by some 


™ Some case can be made of this—‘the author of Academica’ cannot be substi 
tuted for ‘Cicero’ in ‘gf (is Cicero, is Cicero)’ (Notice, however, that the language 
of plurals officially contains no definite descripuons.) But the problem with 
Schema XIa anses primarily because ıt differs from the correct formulaton of the 
principle of substitutivity for predicates: 


A(X, Y) > [$ > $], where ¢’ ıs obtained from ¢ by replacing Xin some 
places by Y 
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things as such if and only if it is instantiated by every one of them.™ 
We can see this by considering their analysis in terms of their singu- 
lar bases. The following definition of the predicate ‘being-a- 
human’’, for example, can be seen to give an analysis of the 
property of being human [s]: 


H” (as) m., VIHA, as) + A(6)] 


That is, for some things to be human[s] is for them to be such that 
every one of them is a human. Thus, the plural property is instanti 
ated by, for example, Bill and Hillary (as such) if and only if it is in- 
stantiated by Bill and by Hillary; for ‘H?({b@A])’ and ‘HP(b) A 
H*(h)’ are logically equivalent. But this logical equivalence relies on 
the nature of the plural expansion; sentences of the forms ‘¢@(t@u)’ 
and ‘@(t) A @(u)’ are not in general logically equivalent. Accord- 
ingly, most of the other plural properties, such as being two, are 
nondistributive.*’ 

Let me now focus on a special kind of plural properties: numerical 
properties. I distinguish two kinds of them: impure ones like being 
two humans or being two-membered; and pure ones like being two. I 
shall analyze the impure ones in terms of the pure ones, and the 
pure ones in terms of logical resources only. 

Being two humans, for example, can be analyzed in terms of (i) 
being two and (ii) being humans[s] according to the following deft 
nition: 

BE-TWO-HUMANS (as) ™,, BE-TWO(as) A A¥(as) 


Thus, for some things to be two humans is for them to be two and 
also humans. The analysis separates the numerical component of the 
impure numerical property from its non-numerical component. This 
clarifies that Bill and Hillary’s being of the same kind is irrelevant to 
their being two. 

Similarly, we can distil the numerical component of being two- 
membered, for example, by analyzing it as follows: 


istwo-membered(a) ™,, Tfs(BE-TWO(6s) A V6[H(6, Bs) + dea) 


For something to be two-membered is, roughly, for it to have as its 
members some things that are two. This analysis conforms to, and 
clarifies, the idea that, for example, the set {Bill, Hillary} instanti- 


*T use ‘some things’, and so on, to mean the same as ‘some one or more things’, 
and so on. Thus, the above sentence implies ‘it is instantiated by something if and 
only if it is instantiated by every one of it’. 

P There are exceptions like beng at least one. 
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ates being two-membered because Bill and Hillary as such instanti- 
ate being two, not vice versa. The relation of membership, indi- 
cated by the predicate ‘c’, plays only the subsidiary role of relating 
the set to the people in question, its members. Thus, the relation, 
too, is extrinsic to the numerical feature proper of being two- 
membered. 

It is the same with other impure numerical properties like having 
one offspring, having two parts, or having three elements. They in- 
herit their numerical character from the pure numerical properties 
that underlie them as their common components. 

Next, we can analyze the properties of being one and being two as 
follows: 


BE-ONE (as) =, 3AVE[H (8, as) + b=] 
BE-TWO (as) ™,, Apay(BAy A V6(H(6, as) e =p V &=y]) 


It is straightforward to give similar analyses of the other properties 
corresponding to natural numbers (greater than one).® 

Now, we can confirm the logical equivalence of facts like (1) and 
(3), which can be stated by the following sentences: 


(la) BE-TWO ([b@A]) 

(3a) b#A A [ABB=b A ABB=A] 
These sentences are logically equivalent, given the definition of the 
predicate ‘BE-TWO’, because the following is a logical truth: 


VéTH(5, [b@A]) + 5=5 V 5=h]™ 


We can also confirm that the existence of, for example, the set {Bill, 
Hillary} is not a logical consequence of Bill and Hillary's being two. 
For sentence (la) does not logically imply the singular construction 
‘da[bea A hea]’, because, as we can see using the conservation prin- 
ciple, sentence (3a) does not do so. 

Finally, notice that the definition of, for example, the predicate 
‘BE-TWO’ can be modified as follows, using only bounded quant+ 
fiers: 


BE-TWO(as) ™,, 3A (H(A, as) A dy[H(y, as) A BEY A VE(HE, as) 
> [H(4, as) + 6=8 V 6=y])]) 


* Or we can analyze them recursively in terms of their respective predecessors: 


BE-(N+1) (as) =, SfLys(BE-M ys) A H(f, as) A VELH(S, ys)  H(6, as) A 
6%) 


* We can show this using schemas IX-X. 
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This helps to confirm and clarify the idea that instantiating a pure 
numerical property is a matter intrinsic to the things in question (for 
example, Bill and Hillary); whether or not there is any other object 
(for example, the set {Bill, Hillary}) is irrelevant to their being two. 
V. EPILOGUE 

The standard conception of relation is the natural ally of the 
Fregean conception of language and logic. The Fregean language 
extends the Aristotelian language by including, among others, predi- 
cates with more than one argument place; and the Fregean logic ex- 
tends the Aristotelian logic to the extended language. The 
development of the Fregean conception, which helped to make 
clear the limitations of the Aristotelian conception of language and 
logic, led to the establishment of the standard conception, which im- 
proves on the Aristotelian conception of relation that does not allow 
the existence of relations even in the usual sense. Similarly, we can 
regiment plural constructions and formulate their logic as in section 
m, which helps to clarify the limitations of the Fregean conception 
and replace the standard conception with a broader conception that 
allows plural relations. Such work, however, does no more than help 
to remove blindfolds, without which we can directly see the limita- 
tions of the standard conception. It is, after all, two things that are 
two. 

BYEONG-UK YI 
University of Queensland 


2 ls my view, natural numbers are the above-mentioned nume roperties. 
my Understanding the Many (Ph. D. Dissertation, University of Gildea 
rena 1995), chapter 4; and “Numbers and Relations.” 
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COMMENTS AND CRITICISM 


WINGS, SPOONS, PILLS, AND QUILLS: 
A PLURALIST THEORY OF FUNCTION 


eth Preston’s “Why Is a Wing Like a Spoon; A Pluralist Theory 
B: Function”' contains misunderstandings of my theory of 

proper functions and of its relation to the notion she calls sys- 
tem functions. I should like to clear some of this misunderstanding 
away, and at the same time show why system function is not a deter- 
minate notion and certainly cannot do the job Preston wants it to do 
in defining artifact functions. 

After citing my Language, Thought, and Other Biological Categories’ 
(and only that), Preston sums up “function theorists” generally, re- 
marking, “it is altogether common to assume that a theory of artifact 
function will simply fall out of the general theory of function now be- 
ing articulated, even though this purportedly general theory of func- 
tion is based on an examination of biological function” (216). This 
comes immediately after remarking on my treatment in LTOBC that 
“[t] he one difference she notes [between tools and other things with 
functions] is that the proper functions of biological devices, lan- 
guage devices, and representational mental states are not the result 
of purposive design, as are the proper functions of tools” (215). 
Now, I thought that that particular difference between artifacts and, 
for example, body organs, was apparently a huge difference, indeed, 
a seemingly insurmountable difference. Moreover, I believe a 
unique feature of my discussion of proper function in LTOBC was 
the care with which I tried to articulate the notions of derived and 
adapted functions (chapter 2) in an attempt to show exactly how, de- 
spite first appearances, human purposes themselves have proper 
functions, and how a result of this is, first, that all human actions 
have proper functions and, then, that artifacts have proper func- 
tions. The attempt to show exactly how such things as artifacts could 
be subsumed under the same notion of function as that for language 
devices and biological devices was one of the central and most diff 
cult tasks of LTOBC. Indeed, Preston engages in an extended cri 
tique of my way of doing this, clearly belying her claim that “(t]he 
nature of artifacts generally, and the nature of their functionality in 
particular, is taken to be so transparently obvious...that virtually no- 


! This JOURNAL, XCV, 5 (May 1998): 215-54. 
* Cambridge: MIT, 1984; hereafter LTOBC. 
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body has bothered to examine it at any length” (215). Daniel Den 
nett, of course, is another who has examined the nature of artifacts 
at some length.’ 

Nor was my “purportedly general theory of function...based on an 
examination of biological function.” Let me quote from the intro- 
duction and chapter one of LTOBC: 


Proper function is intended as a technical term. It is of interest be- 
cause it can be used to unravel certain problems, not because it does or 
doesn’t accord with common notions such as “purpose” or the ordi 
nary notion “function.” My program is far removed from conceptual 
analysis; I need a term that will do a certain job and so I must fashion 
one (op. cit, p. 2). 


I need [the definition of “proper function”] in order to talk about analo- 
gies and disanalogies among things belonging to quite diverse cate- 
gories—body organs, tools, purposive behaviors, language elements, 
inner representations, animal's signals, customs, etc....the purpose being 
to make as explicit as possible analogies among categories of things, 
which analogies had struck me as useful to reflect on...the spirit in which 
I offer [the definition] to the reader is as a handle by which to grab hold 
of the analogies.... If the analogies are really there, it will undoubtedly 
take time and an interchange of ideas before they can be explicated en- 
tirely adequately (op. at, p. 38). 


From these passages, and also from the definition I gave, and from 
my examples which were mainly of language functions in the early 
chapters of LTOBG, I thought it would be clear that the term ‘bio- 
logical’ in my title was used not literally, but broadly or metaphor? 
cally. It should be noted as well that the compound term ‘proper 
function’ was my own coinage. I intended (as suggested at LTOBC, 
page 2) Webster’s Dictionary’s first meaning of ‘proper’, which coin- 
cides with that of the Latin propium, meaning one’s own.‘ Certainly, 
nothing “judgmental” was meant here (224), and if proper functions 
are capacities a thing “ought to have” (224), then remember that the 
sky can also look “like it ought to rain tomorrow.” Normative terms 
are not always evaluative, but can indicate any kind of measure from 
which actual departures are possible. For example, a numerical aver- 


* See, for example, “The Interpretation of Texts, People, and Other Artifacts,” 
and Phenomenological Research, supplement to Volume v (Fall 1990): 177- 
94. See also his “Preston on Exaptation: Herons, Apples, and Eggs,” this JOURNAL, 
xcv, 11 (November 1998): 576-80. 
‘The root is found, for example, in ‘property’ and in the verb ‘to appropriate’. 
‘Functioning properly’ is, of course, a perfectly ordinary English phrase, but there 
‘proper’ tends toward an evaluative meaning. 
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age is also a “norm.” (These last two points are trifling in the context 
of Preston’s essay. I mention them only because other commentators 
have misunderstood me in the same ways, resulting sometimes in 
serious confusions. Clearly, I should have been more explicit be- 
fore.) 

Preston’s first serious error results from the assimilation, under 
her term ‘system functions’, of the sort of functions Robert Cum- 
mins had in mind in his classic paper “Functional Analysis ”* to 
Stephen Jay Gould and E. S. Vrba’s* “exaptations.” The categories 
“exaptation” and “adaptation,” as defined by Gould and Vrba, are ex- 
clusive. If a thing has been selected for because it has had a certain 
effect, that effect is an “adaptation” and is therefore not an “exapta- 
tion.” Cummins functions, on the other hand, clearly include func- 
tions that a system was selected for performing: “Functional analysis 
in biology is essentially similar. The biologically significant capacities 
of an entire organism are explained by analyzing the organism into a 
number of ‘systems’——the circulatory system, the digestive system, the 
nervous system, etc.—each of which has its characteristic capacities” 
(op. cit, p. 760-61, quoted by Preston, 220). Obviously, many of the 
capacities of these systems were selected for and Cummins is well 
aware of that. But it is not because these capacities were selected for 
that he calls them “functions.” 

Having conflated Cummins functions with exaptations by calling 
both “system functions,” Preston now claims that “Millikan’s pluralist 
tolerance for system function should translate into a similar toler- 
ance for exaptation, but in fact she is very critical of this notion, ar- 
guing that the vast majority of the phenomena taken to be 
exaptations are actually adaptations, that is, the functions in ques- 
tion are actually proper functions rather than system functions” (228). 
But, of course, the claim that something is an adaptation rather than 
an exaptation has nothing to do with the question whether it has a 
Cummins function. All adaptations have (or once had) Cummins 
functions when they are given analyses in the context of appropriate 
biological systems. Thus, I remain a staunch pluralist. There are (at 
minimum) two kinds of functions, Cummins functions and proper 
functions, and lots of functions belong to both these kinds. 

Cummins functions are dispositions of parts of some designated 
containing system, or simpler dispositions of the whole system which, 
added together, account for some complex capacity one wants to ex- 


* This JOURNAL, LXXI, 20 (November 20, 1975): 741-65. 
**Exaptation: A Missing Term in the Science of Form,” Paleotsology, vii (1982): 
4-15 
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plain. Exaptations are supposed to be Cummins-style functions 
where (1) the chosen capacity to be explained is how the system con- 
tributes to survival and proliferation of a particular species of organ- 
ism, and (2) the capacity also happens not to be an adaptation. The 
crucial question for a theory of exaptations is not, as Preston sup- 
poses, how often and how soon exaptations become adaptations, but 
whether “contributes to the survival and proliferation of a species” is 
a well-defined notion.’ 

In describing the general form that functional analyses take, Cum- 
mins himself mentions flow charts, circuit diagrams, and computer 
programs. The most common uses we make of these items, however, 
is to describe how a system is designed, that is, quite literally, what 
someone designed it to do. Designed systems come with all kinds of 
specifications (both explicit and implicit) attached. They come with 
assumptions about what to count as parts of the system versus parts 
of the outside world or things accidentally dropped into the system; 
about what to count as appropriate background conditions in which 
the system is to operate; about what to count as appropriate input to 
and output from the system; about what to count as “state changes” 
within the system as opposed to damage, breakdowns, or weardowns; 
about what to count as the system itself having a certain capacity ver- 
sus some interference from outside the system’s being partly respon- 
sible for something it does; and so forth. Unfortunately, members of 
living species do not come with directions telling what is relevant to 
proper operation of the system rather than irrelevant or external to 
it. To be an exaptation, however, a thing must first of all be a part of 
the system under analysis. How do we determine what is to count as 
part of the biological system? 

Let me first parody. Cats have an interesting capacity to walk 
which it would be interesting to have analyzed. But anaesthetized 
cats have exactly the same capacity; indeed, dead cats do, if strung 
up as puppets. If only the capacity to walk is in question, it is not 
logic that rules out attached puppet strings as suitable conditions for 
function and forces exerted on these strings as “input” while ruling 
in a surrounding atmosphere containing the right amount of oxy- 
gen, the right force of gravity and a surface underfoot that affords 
enough friction. To define a biological system, it obviously is not 
enough to indicate a chunk of matter and an interesting capacity 
that it has. Which among the many ways that various members of the 
species might be maintained are relevant to a biological system? An 
obvious beginning answer is that the appropriate conditions and in- 


1 See my White Queen Psychology (Cambridge: MIT, 1993), chapter two. 
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put must characterize or have characterized various actual members 
of the species. Which actual members? 

Given conditions that obtain in a modern hospital, the sounds 
that one’s heart makes can easily become an operative part of a set 
of sufficient conditions leading to quick diagnosis and medical atten- 
tion for a life-threatening heart ailment, producing helpful “inputs” 
to one’s system via needles and pills. Does this make these sounds 
exaptations? Similarly, if the heart emits a healthy sound, this may 
function immediately to focus medical attention somewhere else, 
failing which one’s life might be lost. Do the conditions and inputs 
associated with modern hospitals count as appropriate ones in which 
to consider the species-maintaining functions of parts of human bod- 
ies? Have enough humans survived yet due to kidney machines to ad- 
mit their presence and interference as allowable conditions for the 
human biological system to run in? How about penicillin injections 
or pills? Compare these conditions and inputs, for example, with 
what loving parents put into human infants. A carefully structured 
environment and proper input is necessary for the continued nor- 
mal functioning of many biological systems. On what principled line 
then do we distinguish between ‘within’ and ‘without’ for the 
species-maintaining Cummins system. Hence in what principled way 
do we define “exaptation”? 

Nor is this sort of difficulty in delimiting the biological system's 
boundaries tied just to purposive interactions with such systems. Por- 
cupines are occasionally saved from breaking bones when they fall 
out of trees (something which they do astonishingly often) by the 
springiness of their quills. Should we include this helpful disposition 
of their quills as an exaptation? Porcupines are especially fond of 
pine-tree bark, hence when they fall out of trees, it is often on soft 
pine needles. Is their love for pine bark an adaptation to prevent 
harm when falling? Should we also include as exaptations the dispo- 
sitions toward religion that sometimes result in bibles in breast pock- 
ets that stop stray bullets that would otherwise have killed soldiers? 
How frequently do such events have to occur in the life of a porcu 
pine or a soldier to be exaptations? Why exactly that frequentlye 

Flow charts, circuit diagrams, and programs typically describe sys- 
tems that are purposefully maintained by humans in conditions 
which will not relevantly alter the materials of which they are made 
and which are purposefully isolated from all but appropriate inputs. 
Their analyzed capacities correspond to purposes humans have for 
them and their analyzing capacities are how humans intend them to 
operate. For these reasons, it is clear how they should be analyzed as 
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having Cummins functions. A life system maintains itself. Moreover 
it often does so in an environment that changes in important re- 
spects both over the individual organism’s lifetime and from genera- 
tion to generation. The question what counts as part of the 
“speciesimaintaining” Cummins system, as opposed to helpful but ac- 
cidental interference from outside, has no determinate answer. 

Systems deacribed by flow charts, circuit diagrams, and programs 
generally fulfill their assigned capacities deterministically, so that, if 
the system does not break down, the capacity is always realized. Many 
important capacities of animals, on the other hand, are stochastic ca- 
pacities. The hunting behaviors of predators typically enable them to 
capture prey only some of the time. The eye-blink reflex protects the 
eye from damage only once in a while, usually being triggered un- 
necessarily. If we set aside whether a capacity has been selected for, 
and also whether it is required to help fulfil another capacity that 
has been selected for, there is no principled way to distinguish 
among such capacities whether to count them within a species- 
maintaining Cummins system or not. Nor will it help to require that 
each member of the continuing species use a stochastic capacity 
some of the time. The basic needs of the organism are often met in 
alternative ways. What serves a certain function often is not actually 
necessary to its accomplishment. A domestic cat may never success- 
fully employ either its ability to see prey in the dark or its ability to 
fish, making a good living nonetheless by daytime mouse hunting, 
perhaps, and by charming people. For animals that learn or reason 
out ways to do things, if one path to “species maintenance” is 
blocked, another will be taken instead. Many humans have survived 
in part because they have learned to hang warm clothes from their 
shoulders, but if they did not have shoulders they would attach the 
clothing some other way instead. The capacities to see prey in the 
dark, to be a shoulder clothes hanger, and so forth are thus not nec- 
essary to species maintenance. Do they count, then, as part of the 
Cummins style system of species maintenance? Do shoulders have 
supporting clothes as one of their species-maintaining Cummins 
functions or not? If they do not, does the pancreas also not have as a 
speciessmaintaining function the production of insulin because in its 
absence people take insulin by injection? 

My conclusion is that there can be no well-defined notion here 
unless without principle, that is, by arbitrarily stipulating some per 
centage of individual lives in some stipulated population that must 
have happened to be furthered by a candidate exaptation. And even 
so, itis not clear how we are to deal with the things which have some- 
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times been useful but were nonetheless entirely dispensable. And be- 
cause there is no such thing as the Cummins functions associated 
with maintenance of a species, there is no such thing as the exapta- 
tions associated with it’ 

The only way to place nonarbitrary limits on what counts as part 
of a biological system is to bound it with adaptations, with proper 
functions. Only Cummins functions that either are adaptations or 
support adaptations are sensibly treated as belonging to biological 
systems. That was the conclusion of my 1993 chapter on exapta- 
tions with its “expanded” notion of proper function which Preston 
decries. She herself ignores all problems concerning delimitation 
of the biological system. She also ignores the restrictions I placed 
on these “expanded” proper functions, namely, that they must be 
functions of reproduced parts of the same organism.’ I now think 
Preston is right, however, that it would be better not to treat these 
expanded functions as varieties of proper function for the other 
reason she gives. The notion of a proper function was introduced 
as a tool for understanding purpose and failure of purpose. If an 
item has been selected for producing some effect, production of 
this effect can be held up as a measure or norm from which depar- 
tures are possible. But my expanded functions were not selected 
for, hence they fail to exhibit this most essential property of proper 
functions. 

Most important, Preston has thoroughly misunderstood my no- 
tions of “adapted proper function” and “derived proper function.” 
These are not add-ons to my original definition of proper function, 
and they are not established “without any history of reproduction” 
(233) or through some kind of equivocation in the notion of selec- 
tion. Let me try to explain. 


*For more careful reflections on this matter, see my “Bio-functions: Two Para 
digms,” m Cummins, A. Ariew, and M Perlman, eds., Functons in Philosophy of Biot- 
ogy and Philosophy of Prychology (New York: Oxford, forthcoming). 

* Preston claims, for example, that my expanded definition of “proper function” 
will give human noses the function of holding up eyeglasses. Her idea is that eye- 
glasses, having continued to be reproduced because they helped people see better, 
have making people see better as a proper function, and a normal condition for 
performance of this function is the presence in the right place of a human nose. 
But this ignores the restriction I gave on those normal conditions which were to 
have proper functions, namely, that they were to result from reproduced structures 
of the same organism. I suppose it might be possible to define a “species 
maintaining” Cummins system for eyeglasses which would require humans in the 
environment for its operation, but this system would not be a human o 
nor would it be this system which was responsible for reproducing noses. She has 
not shown, then, that on my definition, it is any more a proper function of my nose 
to hold up my eyeglasses than it is a function of a dog’s stomach to provide shelter 
for roundworms. For further discusnon, see my “Bio-functions: Two Paradigms.” 
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Consider the perceptual, cognitive, and behavioral systems of an 
organism. These mechanisms, I have argued, have proper func- 
tions.” These functions are performed by altering the relation be- 
tween the organism and the environment as needed so that the 
environment will provide appropriate conditions and inputs for the 
organism. Some of these functions involve changing the environ- 
ment to fit the organism, some involve changing the organism to fit 
the environment, and some involve changing merely spatial relations 
between organism and environment, or involve some combination 
of these three, The characteristic systems that are employed in coor- 
dinating states and happenings between organism and environment 
have relational proper functions of one kind or another. That is, 
their job is to make it the case that the organism and the environ- 
ment bear some particular relation to one another. Their job is to 
create relational structures. 

As with any other proper function, a relational proper function of 
a mechanism corresponds to an effect which ancestors of the mecha- 
nism have historically had which accounted for their selection. In 
this case, the effect was the creation of an abstract relational struc- 
ture. Consider a photocopier. Does it do the same thing every time 
you use it? One time it turns out a manuscript on Gottlob Frege, the 
next time a picture of Marilyn Monroe. But the important thing is 
that it turns out, every time, something that looks like your original. 
In this sense, it does the same thing every time. Its effect is always the 
production of the very same abstract relational structure, namely, same- 
ness between the pattern of surface marks on what is in its scanner 
and what is in its output tray. 

In order to produce this relational structure, to effect that this re- 
lation exists, it does not need to produce both relata. It produces 
only one of the relata, but in such a fashion as to effect the existence 
of the designated relation. Nor is there any cheating here. No device 
can produce an effect, a result, a product, ex nikilo. Every function 
performed is performed using materials of some kind, using proper- 
ties of materials already at hand. Producing a relational structure by 
being guided by one relatum to produce the other is as legitimate a 
form of producing as producing a wooden spoon by choosing a 
piece of wood and carving it. 

The example I originally used of a mechanism with a relational 
proper function was the chameleon’s pigment rearranging mecha- 
nism (LTOBC, chapter two). Its function is to produce the relational 


™ White Queen Psychology, chapter two. 
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structure skin-bearing-the-same-color-as-its-background for the 
chameleon, a further proper function being, of course, to prevent 
predators from seeing the chameleon. To create this relational struc- 
ture, the mechanism effects changes in the chameleon but not in 
the environment. Other animals effect production of similar rela- 
tional structures by moving into parts of the environment which 
match them, that is, by changing the spatial relation between them- 
selves and the environment. And likely there are animals that 
change the environment, say, the surroundings of their nests, in or- 
der to produce this kind of relational structure. 

There are a great number of relational structures besides those in- 
volving sameness relations which mechanisms may have as proper 
functions to produce. I have argued that any mechanism whose job 
is to produce a representation of fact (or to produce an “indicative 
intentional icon”—a more general category than that of representa 
tions of fact) has as a proper function to produce a relational struc- 
ture (LTOBC, chapter six). An accessible example that I keep 
repeating is the mechanisms that produce the dance of the honey 
bee. The job of these mechanisms is to produce a relational struc- 
ture having a location of nectar as one relatum and a certain aspect 
of the pattern of the dance as the other. The relation in question is 
the one given by the abstract function (mathematical sense of “func- 
tion”) describing the semantic rules for the B-mese used by the par- 
ticular species of bees. In most species, an angle that the dance 
pattern movement marks out relative to aspects of the hive always 
bears the same definite relation to the angle of the location of nectar 
relative to hive and sun. The properly functioning dance mechanism 
always produces exactly the same thing, namely, this designated relæ 
tional structure. 

I have also argued that any mechanism whose job is to produce a 
representation of a goal (or to produce an “imperative intentional 
icon”—a more general category than that of representations of 
goals) has as a proper function to produce a relational structure 
(LTOBC, chapter six). An accessible example, once again, is the bee 
dance, which is both indicative and imperative.’’ The mechanisms 
that produce bee dances have as a further proper function, beyond 
producing the dance, to send watching fellow worker bees off in a 
certain direction. The relational structure thus produced has as re- 


"Tt 18 a “pushmi-pullyu representation.” See my “Pushmipultyu Representa- 
tions,” in James Tomberlin, ed., Philosophical Perspectives, Volume IX (Atascadero 
CA: Ridgeview Publishing, 1996), pp 185-200. Reprinted in Mind and Morals, L. 
May and M. Friedman, eds. (Cambndge: MIT, 1996), pp. 145-61. 
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lata (1) the angle that the dance pattern marks out relative to as- 
pects of the hive and (2) the angle in which the watching bees fly ret 
ative to the hive and the sun. In this case, the mechanisms create this 
relational structure by making changes not in the organism itself but 
in its environment—changes not in the bee itself but in the bee’s fel- 
low workers. As a result of producing, first, the proper dance/nectar- 
location structure, the properly functioning dancing-making 
mechanism later effects always exactly the same thing, namely, exis- 
tence of this second relational structure between angle of dance and 
angle of flying workers. (This requires the right environment, of 
course, one in which there are wellfunctioning fellow workers avail- 
able. Their presence is a historically “normal condition” for manag- 
ing to perform this function.) 

Where the proper function of a trait is to produce a series of ef- 
fects each causing the next it is also the proper function of each 
stage, on the definition of proper function that I gave, to produce 
the next stage. So what we have here is the production of one rela 
tional structure—dance/nectarlocation—having as a proper func- 
tion the production of another—dance/direction of flying. Finally, 
and as a logical result (given Euclidean geometry), one more rela- 
tional structure is produced, namely, workers flying toward nectar. 
This is typical. Proper-functional relational structures typically do 
their jobs by producing further relational structures that eventually 
produce a relation between organism and environment yielding con- 
ditions or inputs that the organismic system needs. Sometimes, the 
whole process involves changing only the environment, and some- 
times, only the organism; but sophisticated relational proper func- 
tions typically involve both. They also typically cooperate with other 
structures having other relational proper functions, such as the bee 
dance producers’ cooperation with answering mechanisms in fellow 
worker bees, which mechanisms could be given a similar relational 
analysis. 

The general picture, then, is of proper-functional processes that 
involve series of interweaving stages each of which is an abstract rela- 
tional structure, some moments in these processes producing 
changes in the organism, others producing changes in the world, 
but always involving the exactly the same relations, though among 
different relata each time they are run. They are reproduced invari- 
ant processes, always the same when described in the way that ex- 
plains how they work, that is, what their relevant Cummins functions 
are, yet different in their elements each time. It was to simplify the 
description of these complex relational structures and processes that 
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I introduced in LTOBC the terminology ‘adapted proper function’ 
and ‘derived proper function’. I intended this merely as useful 
nomenclature. It is not an addition or a set of extra clauses widening 
or narrowing the original definition of ‘proper function’, but merely 
a way of talking more easily about phenomena that had already been 
captured by that notion, given that traits and mechanisms can have 
relational proper functions. 

When a mechanism has a relational proper function, it may pro- 
duce one of the relevant relata while the other relatum is not af 
fected but either (1) remains the same or (2) undergoes its own 
independent course of development A simple example of (1) is the 
chameleon’s pigment-arranging mechanism which changes the 
chameleon’s color while the color of the background remains the 
same. A simple example of (2) I take from Brian C. Smith.’* Suppose 
that a species of sunflower not only tracks the sun, but continues to 
move when the sun goes behind a tree so as to catch up with it on 
the other side. Here, the mechanism produces changes in the organ- 
ism designed to maintain a certain organism-environment relation, 
and does so as the environmental relatum is changed, but not, of 
course, by the flower. Similarly, the relational structure that consists 
in the retinal image of a male hoverfly mapping the position and am- 
gle of approach of a passing female has as an eventual proper func- 
tion production of another relational structure consisting in the 
male’s intercepting the female.’ As with the sun and the sunflower, 
the female is (as yet) unaffected by the male. Only a change in the 
direction of flight of the male is effected. 

Now examine the relatum that is actually produced by a mechanism 
with a relational proper function. Consider the brown skin produced 
by the pigment arrangers of the chameleon sitting on a brown back- 
ground. The job of the pigment arrangers is to produce the rela- 
tional structure, skin-color-maiching-its-background. But neither 
relatum, brown background or brown skin, is an operative part of an 
historically normal set of sufficient conditions explaining the capac- 
ity of the chameleon to become camouflaged. Either relatum might 
have been replaced, and so long as the other was similarly replaced, 
the chameleon would have been properly camouflaged. Being 
brown is not an operative part of a normal set of sufficient cond? 
tions for performance of any of the chameleon’s functions. Neither 
the relatum produced nor the relatum to be matched has a proper 
function nor, notice, any kind of Cummins function either. Not all 


2 The Ongin of Objects (Cambridge: MIT, 1996). 
3 For details, see my Whuts Quesn Psychology, chapter 11. 
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by itself! Only the whole relational structure has either kind of func- 
tion. 

Given that brown is the color of the background, however, the job of 
the pigment rearrangers is certainly to make the skin brown. I call 
this kind of job an adapted proper function of the mechanism. Turning 
the skin brown is not usually a proper function of the mechanism, 
and it will not remain a proper function of the mechanism when the 
chameleon no longer sits on something brown. Right now, however, 
given that it is on a brown background, it is an adapted proper func- 
tion of the mechanism to make the chameleon brown. Turning the 
skin brown is a proper function of the mechanism “as adapted to” 
the brown color underneath. It is not, of course, a simple, but only a 
conditional proper function of the mechanism, an “iffy” proper 
function, but the ‘if’ part has been asserted. 

The product produced by a device performing an adapted proper 
function I called an adapted device. The chameleon’s brown color is 
an adapted device. It is not, of course, brown itself that is an adapted 
device, but only the brown skin color of a chameleon produced in 
the nght way in the right circumstances. Thus, Preston is certainly 
right that we do not want to move, for example, from the premise 
that a certain English word token has a function derived from its par- 
ticular speaker’s intention to the conclusion that the English lan- 
guage has changed (237). But the idea that derived proper functions 
are, as she rather misleadingly puts it, “established for particular ex- 
emplars” clearly is not “inconsistent” with the notion that all proper 
functions “essentially involve a selection history” (234). 

Now ask about the functions of the relata themselves in a proper- 
functional relational structure. The proper function of a whole re- 
lational structure, as was said, is often to produce another 
relational structure. The bee-dance mechanism produces the rela- 
tional structure, dance-mapping-thetlocation-of nectar, a proper func- 
tion of which is eventually to produce another relational structure, 
worker-bees-heading-toward-nectar. The nectar, of course, is and re- 
mains an independent relatum (nor, of course, is it reproduced by 
the biological system), so taken by itself, it cannot have a proper 
function derived from that mechanism. But the other relatum, the 
bee dance, is produced by the mechanism in accordance with an 
adapted proper function: it is an adapted proper function of the 
mechanism, as adapted to the location of the nectar, to produce a 
certain bee dance, one that maps this location. Further, it is an 
adapted proper function of the dance-producing mechanism to 
produce, as a result, a certain direction of flight in fellow worker 
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bees. Is it also a proper function of the dance itself to produce this 
direction of flight? 

The answer may at first seem to be “no,” for it seems theoretically 
possible, at least, that the particular bee dance has no ancestors. Per- 
haps no bee in the bee’s lineage ever danced this particular dance 
before, because there never happened to be nectar located in this 
particular direction from their hives before. Then this particular bee 
dance, having never occurred in the past, certainly could not have 
been selected for any effects that it had, hence could not possibly 
have any proper functions. 

But this overlooks a principle that is fundamental. Because bee 
dances that map different directions are different from one another 
does not mean they are not also the same. The original and the pho- 
tocopy may be different colors because colored paper was used for 
one but not the other, but the manuscript about Frege, or the pic- 
ture of Marilyn, is the same. What is of interest is whether there is a 
sameness among the dances such that they are all able to do some 
thing that is the same, and whether that very something is what their 
ancestors were selected for doing. 

The various dances of a given species of bees are very much the 
same—indeed, to an untutored observer, hardly discriminable. And 
when they function in the way that has accounted for the natural se- 
lection of their producers and of their answering mechanisms in 
other workers, they always do exactly the same thing. They produce 
a direction of flight that is a given function (mathematical sense) of 
certain aspects of their form—in every case exactly the same func- 
tion of that form. In LTOBC, I put this rather awkwardly by saying 
that they have (“direct”) proper functions which are adapted to their 
own concrete forms (LTOBC, chapter two and elsewhere). I am not 
sure that I have put it less awkwardly here. But the phenomenon it- 
self is really quite easy to grasp. The bee dance has been selected in 
part for its capacity to cause other worker bees to be guided in their 
direction of flight by its form. In this respect, all bee dances (of the 
same species) have exactly the same proper function. Strictly speak- 
ing, this function is a relational proper function. The dance’s job is 
to cause the workers to fly in a direction that bears a certain relation 
to itself (for example, LTOBC, p. 42). 

Because of this relational proper function, depending on the par- 
ticular form of the bee dance, it has as an adapted proper function 
to cause worker bees to fly in some particular direction—say, south- 
south-west—just as the relational proper function of making the 
chameleon’s skin match its background results in an adapted proper 
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function to turn the skin brown when the chameleon is sitting on 
something brown. I have called this kind of function a derived proper 
function, derived originally from the proper function of the dance- 
producing mechanism. It is derived from the particular form of the 
bee dance which is itself an adapted device, derived from the 
adapted function of the dance-producing mechanism as operating in 
a certain context. All derived proper functions are adapted in the 
same sort of manner. 
In LTOBG, I generalized this idea: 


The proper functions of adapted devices are derived from proper func- 
tions of the devices that produce them that lie beyond the production of 
these adapted devices themselves. I call the proper functions of adapted 
devices “derived proper functions” (LTOBG, p. 41). 


I was not careful to distinguish in this particular passage between 
proper functions of adapted devices that are relational but not de- 
rived from the producer’s particular adaptor (for example, the func- 
tion common to all bee dances) and those that are not relational 
and are derived from the producer’s particular adaptor.’* I had in 
mind the latter kind. Indeed, an adapted device described qua adapted 
is different from other adapted devices derived from the same pro- 
ducer but in different adapting circumstances. And I emphasized 
that the interesting thing about derived proper functions is that 
things that have them can be things that are “quite new under the 
sun.” That is, I meant things with adapted proper functions as being 
named or described in accordance with their adapted aspects (they 
point in different directions, they are different colors) not the as- 
pects that make them like their ancestors (they are all bee dances, 
they are all pigmented skins of chameleons). Things with derived 
proper functions, then, are individual things, not lineages or types of 
things. It is the brown color of this chameleon that has a derived 
proper function, not brown itself, indeed, not even brown 
chameleon skin color, for maybe someone painted a chameleon 
brown. 

Against this background, I argued in LTOBC that there could be 
relational proper functions that produced adapted devices them- 
selves having relational proper functions, producing more adapted 
devices having further relational proper functions, to any degree of 
nesting, and that out of this sort of structure could come things (to- 
kens) seemingly very new indeed under the sun but which still had 
derived proper functions. I argued that in this sort of way, human 


” I distinguished these in the pages immediately following, however. 
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desires have derived proper functions derived from the mechanisms 
designed to form concepts (adapted to the individual’s 
experience!) and then to form desires (adapted to other aspects of 
the individual’s experience), and that the derived proper function of 
a desire is to get itself fulfilled (a relational function). (Reminder: 
that a thing has a proper function does not imply that it is likely to 
perform that function!) 

Further, if the desire is to produce a certain result by means of 
making, for example, a certain tool or producing a certain sentence, 
then a derived proper function of the tool (token) or the sentence 
(token) is to produce this result. Thus, it happens that artifacts have 
as derived proper functions the functions intended for them by their 
makers (contrast with Preston: not their users). I also argued that 
learned behaviors have the results that effected their having been 
learned as proper functions, direct proper functions when the learn- 
ing is by trial and error, otherwise derived. 

Notice, by contrast, that Preston’s analysis of artifact function has 
the peculiar result that artifacts have no functions except in the 
sense that they may function as, say, a hammer or a screwdriver in 
someone’s hands, and no function unless they have actually been re- 
produced on account of serving this function. So, if I carefully de- 
sign and make a new sort of implement for the purpose of opening 
cans, but nobody ever actually uses it, there is no sense whatever in 
which it is a can opener, or in which its function is to open cans. But 
surely its purpose, at least, is to open cans, and purpose is what 
proper functions were originally supposed to cast light on. My claim 
in LTOBC was that artifacts can acquire proper functions either in 
Preston’s way, that is, “directly,” or through the maker’s (not the 
user's) intentions, or in both ways. I also emphasized the possibility 
not only of shifts in proper function but also of conflicts in function 
when items have more than one source of function. 

It should be clear from the above discussion that the way one de- 
scribes the things that have proper functions, and the way one ex- 
plains how a derived function is derived, makes a great deal of 
difference. Sloppy descriptions of these matters will get one into a 
lot of trouble—as they have Preston. She herself claims that in talk- 


* For details, see my “A Common Structure for Concepts of Individuals, Stuffs, 
and Basic Kinds: More Mama, More Milk, and More Mouse,” Behavioral and Bramm 
Samas, XXI, 1 (February 1998): 55-65; also in E. Margolis and S. Laurence eds, 

Core Readsengs (Cambridge: MIT, 1999) Also see my “Author's Response: 
With Enemies Like Thus I Don’t Need Frends,” Behavioral and Brain Sctences, XXi, 1 
(February 1998): 89-100, and my On Confused Ideas (New York: Cambridge, forth- 
coming). 
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ing about these matters, the descriptions under which one considers 
various functional items do not matter because “there is a whole 
range of levels of abstraction on which such explanations may move, 
and which level is appropriate depends on our specific explanatory 
interests” (235). This is a serious misunderstanding. What counts as 
a correct description of a proper-functional trait and what counts as 
the historically normal explanation for its proper functioning, hence 
as the historically normal conditions for its proper functioning, are, 
ontologically, completely determinate matters, regardless of how dif 
ficult the epistemological issues become. These descriptions and ex- 
planations concern the actual historical selection pressures and 
historical causal mechanisms that produced the effects selected for 
in successful ancestors. The description of these causal mechanisms 
has to be one that applies univocally to all of the ancestors. There 
cannot be separate descriptions for separate ancestors, or there is no 
proper function but merely an accidental change in the gene pool. 
Indeed, it also matters very much how traits and mechanisms are de- 
scribed in order to ascribe Cummins functions to them—something 
that Preston overlooks as well.’ 

RUTH GARRETT MILLIKAN 
University of Connecticut 


" For details, see “Bio-functions: Two Paradigms.” 
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Realistic Rationalism. JERROLD J. KATZ. Cambridge: Massachusetts Inst- 
tute of Technology Press, 1998. xxxiv + 226 p. Cloth $35.00.* 


Although billed as a book in the philosophy of mathematics, Jerrold 
Katz’s Realistic Rahonalism is the final chapter in his larger philo- 
sophical project of rebutting contemporary naturalism and regain- 
ing for metaphysics a central place in philosophy. Relying upon his 
earlier The Metaphysics of Meaning’ and Language and Other Abstract 
Objects* for some arguments against W. V. Quine’s and Ludwig 
Wittgenstein’s cases for naturalism, Katz takes positive steps in this 
work toward formulating and justifying a new realist ontology and 
rationalist epistemology. He initiates this task as an investigation in 
the philosophy of mathematics, where the tension between realism 
and naturalism comes into sharp focus, but he soon broadens his 
discussion to defend the existence of synthetic a priori knowledge 
in mathematics, science, and metaphysics and to limit the scope of 
skepticism. Its wide range and novel ideas make this a rich and stim- 
ulating book. 

Katz defines realism as the view that abstract entities exist, and 
sees it as facing three major objections: (1) by their very nature ab- 
stract entities are unknowable; (2) determinate reference to them is 
impossible; (3) the abstract and concrete distinction is untenable. 
He answers the first two objections by arguing vigorously against the 
naturalist presuppositions to which its recent advocates have ap- 
pealed, and by showing how rationalism can provide realism with a 
respectable companion epistemology. In what I found to be the most 
interesting part of the book, he responds to the third objection by 
introducing a new category of objects that bridge the gap between 
the abstract and concrete. These composite objects—the equator 
and linguistic tokens are examples—contain both abstract and con- 
crete components. Unfortunately, I shall have the space here only to 
discuss Katz’s responses to (1) and (2). 

Katz bases part of his argument for realism and rationalism upon 
the inadequacies he finds in nominalism and empiricism. For exam- 
ple, he claims that Hartry Field's nominalism breaks down when 


* I am grateful to Jerrold Katz and Benjamm The for comments on an earlier 


* Totowa, NJ: Rowman and Littlefield, 1981. 


0022-362X/99/ 9604/207-11 © 1999 The Journal of Philosophy, Inc. 


208 THE JOURNAL OF PHILOSOPHY 


one tries to extend it to linguistics, since it confronts the “problems 
with inscriptional nominalism, which doomed the Goodman and 
Quine enterprise” (12). On the other hand, Quine’s empiricism 
fails, he argues, because it cannot account for the certainty of math- 
ematics and logic. The upshot of this criticism is that we must recog- 
nize that mathematics, linguistics, and logic (collectively the formal 
sciences) require an ontology of abstract objects and this, in turn, 
requires a rationalist epistemology. This epistemology should be 
based on reason alone. It must shun both Platonic perceptual ac- 
quaintance with abstract objects and Kantian sensuous intuition so 
that it is entirely independent of experience, whether sensory or in- 
trospective. 

An epistemology of this type will skirt Paul Benacerraf’s famous 
point that it is hard to see how a satisfactory epistemology could ac- 
company a realist account of mathematical truth, since this presup- 
poses that the only satisfactory epistemologies are empiricist. 
Moreover, we are not bound to such an epistemology for the formal 
sciences, according to Katz, for no one has established empiricism as 
the best overall theory of knowledge, and it cannot account for cer- 
tainty in the formal sciences in any case. 

Reason need not generate our beliefs about abstract entities, but, 
in Katz's view, it alone must be used to justify them. For what counts 
as adequate grounds for a proposition depends upon its subject mat- 
ter; and if it is about abstract entities, then its justification must be 
nonempirical. Abstract objects have their intrinsic properties and re- 
lations necessarily, and we can never know these through experience, 
since it can never tell us what must be. (This gives realism the re- 
sources to explain the necessity of truths in the formal sciences. They 
are necessary because they are about abstract objects and describe the 
unchangeable properties and relations of unchangeable objects.) 

How then do we justify knowledge about abstract entities? Katz an- 
swers that we do it through proof, rational intuition, and systematiza- 
tion. A proof grounds a proposition about abstract objects by 
demonstrating the impossibility of the objects’ being other than as 
the proposition claims. But, of course, this works only if the premises 
of the proof are based upon reason alone. Some of these, such as the 
basic “ratiocinative knowledge” that four is composite, are based 
upon rational intuition, while others, such as the principle of mathe- 
matical induction, result from systematization. The latter is a process 
through which we build up systematic connections between clear 
and correct insights and intuitions, and use the resulting body of 
theory to correct unclear intuitions. 
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I shall return to Katz’s epistemology for abstract entities, but let us 
now pass on to reasons for thinking that reference to them is deter- 
minate. According to Katz, all the arguments to the contrary begin 
by excluding certain methods for fixing the reference of a word and 
then describe a situation in which the admissible factors determin 
ing the reference of certain terms are symmetric with respect to a 
certain class of objects. Thus, if the only constraint on whether the 
members of certain sequence are the references of the numerals ‘0’, 
T, ‘2’, and so on is that the sequence be a model of the Peano ax- 
ioms, then, of course, it will remain indeterminate whether the nu- 
merals refer to the Zermelo numbers rather than the von Neumann 
numbers. But, according to Katz, we can break this symmetry: the 
numerals do not refer to sets, for numbers are not sets; “sets but not 
numbers have members, have subsets, contain the null set, and so 
on, while numbers but not sets can be prime or composite, even or 
odd, perfect or imperfect, and so on” (104). Katz does not tell us 
how we know this but I presume that we know it through rational in- 
tuitions. For we cannot prove that numbers are not sets, and I do not 
see how we could arrive at this result through systematization with- 
out begging questions. 

Benacerraf’s famous argument, which Katz is countering here, as- 
sumes that the mathematician’s interest in the numbers stops at the 
level of structure and that it is a mistake for philosophers to try to go 
deeper. But Katz replies that Benacerraf not only has given no argu 
ment for these assumptions, but also falsely assumes that the essential 
properties of numbers are structural. In fact, some of their proper- 
ties—for example, their abstractness—are nonstructural. Thus, Be- - 
nacerraf has illegitimately excluded common-sense and philosophical 
considerations that would fix the reference of number words. This 
lapse is due, according to Katz, to Benacerraf’s acceptance of the 
Quine-Wittgenstein view that philosophy has no subject matter of its 
own and the Fregean view of sense as the determiner of reference. 

If we shed these doctrines, and also accept Katz’s theory of mean- 
ing (developed in The Metaphysics of Meaning), then we will admit an 
important role for synthetic a priori knowledge. For since our con- 
ception of the analytic will be much narrower than Gottlob Frege’s 
and more like John Locke’s, much mathematics, science, and philos- 
ophy will count as synthetic and a priori. Proof, rational intuition, 
and its systematization will provide the ingredients for an epistemol 
ogy for these truths. 

Clearly, the most controversial part of this epistemology is its ap- 
peal to intuition, so I am going to let Katz speak for himself: 
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[An intuition is] an immediate, Le., noninferential purely rational ap- 
prehension of the structure of an abstract object, that is, an apprehen- 
sion that involves absolutely no connection to anything concrete... 
What is present to our minds in a clear and distinct intuition of abstract 
objects is the fact that their structure puts the suppositions of their be- 
ing otherwise than as we grasp them to be beyond the limits of possbil 
ity.... The rationale for claiming that it is intuition that is the source of 
basic mathematical and other formal knowledge is...There are cases in 
which we can eliminate everything but intuition as a posible explana- 
tion of how it is known that a premise or step in a proof has no cour 
terexample (44-45). 


If you find intuition mysterious, as I do, Katz cautions us to distin- 
guish the mysterious from the mystical, and reminds us that philoso- 
phy is full of mysteries. Indeed, he is confident that there is “no 
reason whatsoever to think that scientific psychology cannot in prin- 
ciple explain the rational faculties on which our account of mathe- 
matical knowledge is based, though, to be sure, current scientific 
psychology is far from telling us much about them” (116). 

I am less sanguine, but I do not have space here to give my rea 
sons. In any case, before I could discuss this issue with confidence, I 
would want a fuller and clearer account of Katz’s theory of rational 
intuition than I found in his book. One problem I have traces to his 
distinction between basic and transcendent knowledge of abstract 
objects, with the former being obtained through intuition and the 
latter through systematization. Suppose that we try to apply this idea 
to the law of trichotomy, which figures as a premise in an example 
proof Katz presents (40). This is a general law, thus it appears tran- 
scendent, and Katz seems to take it to be one whose necessity is 
known to us. It also seems to be exactly the sort of nonlogical princi- 
ple that forges connections between pieces of basic knowledge. How 
is it known? By systematization, as Katz suggests general principles 
are known? But then how does systematization get off the ground? 
Maybe we can just hypothesize some initial generalities and plan to 
revise them if we find counterexamples or they lead to unsatisfactory 
systems. (This is consistent with Katz’s view of the a priori; see sec- 
tion 2.6.3.) On the other hand, perhaps we know tricotomy by intu- 
ition, since Katz writes concerning another principle, “even 
mathematically naive people immediately see that, if m things are 
put into n pigeorholes, then when m is greater than n, some hole 
must contain more than one thing.... The only remaining explana- 
tion for the immediate knowledge of the principle is intuition” (45). 
Let us suppose, then, that intuition can supply us with enough gen 
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eral laws to get systematizing going. Katz characterizes intuition (in 
the quote above) as an “apprehension of the structure of an abstract 
object.” This would account for our ability to intuit the number four 
and see that four is composed of two and two. For the pigeon-hole 
principle we would need more. Katz has written me that in this case 
our knowledge is based upon “a concatenation of intuitions.” I 
would like to know more about this, and why it will not supply us 
knowledge of mathematical induction, which, as we have noted, re- 
sults from systematizing. 

MICHAEL D. RESNIK. 
University of North Carolina/ Chapel Hill 
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RESPONSIBILITY, LUCK, AND CHANCE: REFLECTIONS Qa <" 
FREE WILL AND INDETERMINISM* 


| udwig Wittgenstein’ once said that “to solve the problems of 
philosophers, you have to think even more crazily than they 
do” (tbid., p. 75). This task (which became even more difficult 
after Wittgenstein than it was before him) is certainly required for 
the venerable problem of free will and determinism. 
L THE LUCK PRINCIPLE 


Consider the following principle: 


(LP) If an action is undetermined at a time 4 then its happening rather 
than not happening at t would be a matter of chance or hick, and so 
it could not be a free and responsible action. 


This principle (which we may call the luck pnnaple, or simply LP) is 
false, as I shall explain shortly. Yet it seems true. LP and a related 
principle to be considered later in this paper are fueled by many of 
those “intuition pumps,” in Daniel Dennett's’? apt expression, which 


* This paper was prompted by a recent objection made in various forms against 
my view and other incompatbilist views of freedom and responsibility by Galen 
Strawson, Alfred Mele, Bernard Berofsky, Bruce Waller, Richard Double, Mark 
Bernstein, and Ishtryaque Haji. (See footnote 10 for references.) The paper has 
benefitted from interchanges with the above persons and with parncipant at a 
conference on my work on free will at the Univeraty of Arkansas in September, 
1997: Gary Watson, Barry Loewer, Timothy O’Connor, Randolph Clarke, Christo- 
pher Hill, and Thomas Senor. It has also benefitted from interchanges ın confer- 
ences or in correspondence with John Martn Fischer, William Rowe, Nicholas 
Nathan, David Hodgson, Saul Smilansky, Kevin Magill, Peter van Inwagen, Derk 
Pereboom, Laura Ekstrom, Hugh McCann, and Ilya Prigogine. I am especially 
grateful to Mele and Strawson for pursuing me assiduously on these issues since 
the publication of my latest work, and for perceptive comments on the penulti- 
mate draft by Mele, Berofsky, and George Graham. 

1 Cultures and Valus (New York: Blackwell, 1980) 

* Elbow Room (Cambridge: MIT, 1984), chapter 1 and pp. 32-84, 64-65, 119-20, 
169-70 
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support common intuitions about freedom and responsibility. LP 
and related principles lie behind the widespread belief that indeter- 
minism, so far from being required for free will and responsibility, 
would actually undermine free will and responsibility. Dennett does 
not dwell on the intuition pumps of this sort, as I shall do in this pa- 
per. As a compatibilist, he is more interested in criticizing intuition 
pumps that lead people to think (mistakenly, on his view) that free- 
dom and responsibility are not compatible with determinism, 
whereas intuition pumps that support LP lead people to think free- 
dom and responsibility are not compatible with indeterminism. Yet in- 
tuition pumps of the latter kind are every bit as pervasive and 
influential in free-will debates as those Dennett dwells upon; and 
they are as much in need of deconstruction, since they play a signifi- 
cant role in leading people to believe that freedom and responsibil 
ity must be compatible with determinism. 

I think the modern route to compatibilism——-which is the reigning 
view among contemporary philosophers—usually goes through prin- 
ciples like LP at some point or other. In my experience, most ordi- 
nary persons start out as natural incompatbilists. They believe there 
is some kind of conflict between freedom and determinism; and the 
idea that freedom and responsibility might be compatible with deter- 
minism looks to them at first like a “quagmire of evasion” (William 
James) or “a wretched subterfuge” (Immanuel Kant). Ordinary per- 
sons have to be talked out of this natural incompatibilism by the 
clever arguments of philosophers—who, in the manner of their 
mentor, Socrates, are only too happy to oblige. To weaken natural 
incompatibilist instincts, philosophers first argue that what we mean 
by freedom in everyday life is the power or ability to do whatever we 
choose or desire to do—in short, an absence of coercion, compul 
sion, Oppression, and other impediments or constraints upon our 
behavior. They then point out that we can be free in these everyday 
senses to do what we choose or desire, even if our choices and de- 
sires are determined by causes that lie in our past. 

But this line of argument does not usually dispose of incompati- 
bilist intuitions by itself. Ordinary persons might grant that many 
everyday freedoms are compatible with determinism and still wonder 
if there is not also some deeper freedom—the freedom to have an 
ultimate say in what we choose or desire to do in the first place—that 
is incompatible with determinism. (I have argued elsewhere’ that 
this deeper freedom is what was traditionally meant by “free will”) 
So the philosophers must add a second step to their case—an argu- 


> The Significance of Free Will (New York: Oxford, 1996), pp. 10-14, 33-87 
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ment to the effect that any allegedly deeper freedom (of the will) 
that is not compatible with determinism is no intelligible freedom at 
all. And with this step, principles like LP come into the picture. For 
any freedom not compatible with determinism would require inde- 
terminism; and what is undetermined, it seems, would happen by 
chance or luck and could not be a free and responsible action. This 
kind of argument is the one that usually puts the final nail in the cof 
fin of incompatibilist instincts. 

When philosophy professors go through this two-stage argument 
in the modern classroom, they are replicating the standard case 
against traditional (incompatibilist or libertarian) free will which is 
one of the defining characteristics of modernity. The goal is to com 
sign incompatibilist freedom to the dustbin of history with other be- 
liefs that a modern scientific age is encouraged to outgrow. Students 
and ordinary persons subjected to this argument may have an uneasy 
feeling they are being had by the clever arguments of philosophers. 
But, also seeing no obvious response, except an appeal to mystery, 
many of them become compatibilists. 

O INDETERMINISM, THE BOGEYMAN 

The second stage of this two-stage argument in support of compat- 
bilism will concern me here, the one that goes through LP and re- 
lated principles in the attempt to show that indeterminism would 
not enhance, but in fact would undermine, freedom and responsibil- 
ity. What is at stake here is not merely the clever arguments of 
philosophers; for it happens that the case for principles like LP is a 
powerful one. It is difficult to see how indeterminism and chance 
can be reconciled with freedom and responsibility. Philosophers 
have tried to bring this out in a number of ways which will be ad- 
dressed here. We may think of these as the varied intuition pumps 
that support LP and principles like it. 

(1) We are often asked to consider, for example, that whatever is 
undetermined or happens by chance is not under the control of any- 
thing, and so is not under the control of the agent But an action 
that is not under the control of the agent could not be a free and re- 
sponsible action. (Here it is evident that the notion of contro! is in- 
volved in the case for LP: indeterminism and chance imply lack of 
control to a degree that implies lack of freedom and responsibility.) 

(2) Another line of argument often heard is this: suppose a choice 
occurred as the result of an undetermined event (say, a quantum 
jump) in one’s brain. Would that be a free choice? Being undeter- 
mined, it would appear to be more of a fluke or accident than a free 
and responsible action. Some twentieth-century scientists and 
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philosophers have suggested that free will might be rescued by sup- 
posing that undetermined quantum events in the brain could be am- 
plified to have large-scale effects on choice or action.* Unfortunately, 
this modern version of the ancient Epicurean “swerve” of the atoms 
seems to be subject to the same criticisms as its ancient counterpart. 
It seems that undetermined events in the brain or body, whether am- 
plified or not, would occur spontaneously and would be more of a 
nuisance—or perhaps a curse, like epilepsy—than an enhancement 
of freedom and responsibility. 

(3) Nor would it help to suppose that the indeterminism or 
chance came between our choices (or intentions) and our actions. 
Imagine that you are intending to make a delicate cut in a fine piece 
of cloth, but because of an undetermined twitching in your arm, you 
make the wrong cut. Here, indeterminism is no enhancement of 
your freedom, but a hindrance or obstacle to your carrying out your 
purposes as intended. Critics of libertarian freedom’ have often con- 
tended that this is what indeterminism would always be—a hin- 
drance or impediment to one’s freedom. It would get in the way, 
diminishing control, and hence responsibility, instead of enhancing 
them. 

(4) Even more absurd consequences follow if we suppose that in- 
determinism or chance is involved in the initiation of overt actions. 
Arthur Schopenhauer* imagined the case of a man who suddenly 
found his legs start to move by chance, carrying him across the room 
against his wishes. Such caricatures are popular among critics of in- 
determinist freedom for obvious reasons: undetermined or chance- 
initiated overt actions would represent the opposite of controlled 
and responsible actions. 

(5) Going a little deeper, one may also note that, if a choice or ac- 
tion is undetermined, it might occur otherwise given exactly the same 
past and laws of nature up to the moment when it does occurs. This 
means that, if Jane is deliberating about whether to vacation in 
Hawaii or Colorado, and gradually comes to favor and choose 
Hawaii, she might have chosen otherwise (chosen Colorado), given 
exactly the same deliberation up to the moment of choice that in fact led 


1 For example, physicist A. H. Compton, The Fredom of Man (New Haven: Yale, 
1935) and neurophysiologist John Eccles, Facing Reahty (New York: Springer, 
1970). 

* See, for example, Galen Strawson, who argues that, even if free will should be 
incompatible with determinism, indeterminism would be “no help” in enhanang 
either freedom or responmbuility—*The Unhelpfulnem of Indeterminism,” Philoso- 

and Phenomenological Research (forth 
$ Essay on the Preadom of the Will (Indianapolis: Bobbe-Merrill, 1960), p. 47. 
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her to favor and choose Hawaii (exactly the same thoughts, reason- 
ings, beliefs, desires, dispositions, and other charactenstics—not a 
sliver of difference). It is difficult to make sense of this. The choice 
of Colorado in such circumstances would seem irrational and inex- 
plicable, capricious and arbitrary.’ If it came about by virtue of unde- 
termined events in Jane’s brain, this would not be an occasion for 
rejoicing in her freedom, but for consulting a neurologist about the 
waywardness of her neural processes. 

(6) At this point, some defenders of incompatibilist freedom ap- 
peal to Gottfried Leibniz’s*® celebrated dictum that prior reasons or 
motives need not determine choice or action, they may merely “in 
cline without necessitating”—that is, they may incline the agent to- 
ward one option without determining the choice of that option. This 
may indeed happen. But it will not solve the present problem; for it 
is precisely because Jane’s prior reasons and motives (beliefs, desires, 
and the like) incline her toward the choice of Hawaii that choosing 
Colorado by chance at the end of exactly the same deliberation 
would be irrational and inexplicable. Similarly, if her reasons had in- 
clined her toward Colorado, then choosing Hawaii by chance at the 
end of the same deliberation would have been irrational and inex- 
plicable. And if prior reasons or motives had not inclined her either 
way (the celebrated medieval “liberty of indifference”) and the 
choice was a matter of chance, then the choosing of one rather than 
the other would have been all the more a matter of luck and out of 
her control. (One can see why libertarian freedom has often been 
ridiculed as a mere “liberty of indifference.”) 

(7) Indeed, critics of indeterminist freedom have often argued 
that indeterminist free choices must always amount to random 
choices of this sort and hence the outcomes would be matters of 
mere luck or chance—lke spinning a wheel to select among a set of 
alternatives. Perhaps there is a role for such random choices in our 
lives when we are genuinely indifferent to outcomes.’ But to suppose 
that all of our free and responsible choices—including momentous 
ones, like whether to act heroically or treacherously—had to be by 
random selection in this way has been regarded by many philoso- 
phers as a reductio ad absurdum of the view that free will and responsi 
bility require indeterminism. 


* This dilemma for mcompatibilist accounts of freedom is nicely described by 
Thomas Nagel, The View from Nowhers (New York: Oxford, 1986), chapter 7. 

* Selactions (New York: Scribner's, 1951), p. 435. 

* Stephen M. Cahn makes a persuasive case for there being such a role—“Ran- 
dom Choices,” Philosophy and Phenomenological Research, HOW (1977): 549-61. 
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(8) Consider one final argument which cuts more deeply than the 
others and to which I shall devote considerable attention. This paper 
was in fact prompted by new versions of this argument advanced in 
recent years against my incompatbilist account of free will by Galen 
Strawson, Alfred Mele, Bernard Berofsky, Bruce Waller, Richard 
Double, Mark Bernstein, and Ishtiyaque Haji” —though the argu- 
ment is meant to apply generally to any view requiring that free ac- 
tions be undetermined up to the moment when they occur. 

Suppose two agents had exactly the same pasts (as indeterminism 
requires) up to the point where they were faced with a choice be- 
tween distorting the truth for selfish gain or telling the truth at great 
personal cost. One agent lies and the other tells the truth. As Waller 
puts it, if the pasts of these two agents “are really identical” in every 
way up to the moment of choice, “and the difference in their acts re- 
sults from chance,” would there “be any grounds for distinguishing 
between [them], for saying that one deserves censure for a selfish 
decision and the other deserves praise” (op. cit., p. 151)? Mele poses 
the problem in terms of a single agent in different possible worlds. 
Suppose in the actual world, John fails to resist the temptation to do 
what he thinks he ought to do, arrive at a meeting on time. If he 
could have done otherwise given the same past, then his counter- 
part, John* in a nearby possible world, which is the same as the ac- 
tual world up to the moment of choice, resists the temptation and 
arrives on time. Mele then argues that, “if there is nothing about the 
agents’ powers, capacities, states of mind, moral character and the 
like that explains this difference in outcome,...the difference is just a 
matter of luck” (op. at, pp. 582-83). It would seem that John* got 
lucky in his attempt to overcome temptation, whereas John did not. 
Would it be just to reward the one and punish the other for what ap- 
pears to be ultimately the luck of the draw? 


* Strawson, “The Impoeubility of Moral Responsibility,” PAslosophscal Studses, LXXV 
(1994): 5-24, and “The Unhelpfulness of Indeterminism,” op. at; Mele, Review of 
my The Pree Will, this JOURNAL, xcv, 11 (November 1998): 581-84, and 
“Luck and the Significance of Free Will,” Philosophical Exploratons (forthcoming); 
Berofsky, “Ultimate Responnbility in a Determimistic World,” Phi and Phenom- 

Research (forthcoming); Waller, “Free Will Gone Out of Control,” Behavior- 
ism, XVI (1988): 149-67; Double, The Nowreahty of Pree VAL (New York. Oxford, 1991), 
p. 140; Bernstein, “Kanean Libertartanism,” placid Darl Pe E y (1995): 
151-67; Hafi, “Indeterminism and Frankfurttype Exam ee Philosophical Explo- 
raions (forthcoming). Different, but related, concerns 
agency are aired by oe O’Connor, “Indeterminism and Free Agency: Three 
Recent Views,” Philosophy and Phenomenological Research, Li (1993): 499-596; and 
Randolph Clarke, “Free Choice, Effort and Wanting More,” Philosobhical Explorations 
(forthcoming). 
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Considerations such as (1)-(8) lie behind familiar and varied 
charges that undetermined choices or actions would be “arbitrary,” 
“capricious,” “random,” “uncontrolled,” “irrational,” “inexplicable,” 
or “matters of luck or chance,” and hence not free and responsible 
actions. These are the charges which principles like LP are meant to 
express. Responses to them in the history of philosophy have been 
many; but none to my mind has been entirely convincing. The 
charges have often led libertarians—those who believe in an incom- 
patibilist free will—to posit “extra factors” in the form of unusual 
species of agency or causation (such as noumenal selves, immaterial 
egos, or Nonoccurrent agent causes) to account for what would oth- 
erwise be arbitrary, uncontrolled, inexplicable, or mere luck or 
chance. I do not propose to appeal to any such extra factors in de- 
fense of libertarian freedom. Such appeals introduce additional 
problems of their own without, in my view, directly confronting the 
deep problems about indeterminism, chance, and luck to which con- 
siderations (1)-(8) are pointing. To confront these deep problems 
directly, I believe one has to rethink issues about indeterminism and 
responsibility from the ground up, without relying on appeals to ex- 
tracausal factors—a task to which I now turn. 

M INDETERMINISM AND RESPONSIBILITY 
First, one must question the intuitive connection in people’s 
minds between “indeterminism’s being involved in something’s 
happening” and “its happening merely as a matter of chance or 
luck.” ‘Chance’ and ‘luck’ are terms of ordinary language which 
carry the connotation of “its being out of my control” (as in (1) 
and (4) and above). So using them already begs certain questions, 
whereas ‘indeterminism’ is a technical term that merely precludes 
deterministic causation (though not causation altogether). Second, 
one must emphasize that indeterminism does not have to be in- 
volved in all free and responsible acts, even for incompatbilists or 
libertarians.” Frequently, we act from a will already formed; and it 
may well be that our actions are determined in such cases by our 


©! I defend this point at length in Pree Wall and Values (Albany: SUNY, 1985), 
chapters 4 and 5. It is also defended by Peter van Inwagen, “When Is the Will 
Free?” in J. Tomberlin, ed., Philosophical Perspectroes, Volume 3 (Atascadero, CA: 
Ridgeview, 1989), pp. 399-422. John Martın Fischer has described the view that van 
Inwagen and I defend as “restricted libertarianism,” and has cnticized it in “When 
the Will Is Free,” in Tomberiin, ed., Philosophical Perspecteves, Volume 6 (Atascadero: 
Ridgeview, 1992), pp. 423-51. Another cntic is Hugh McCann, “On When the Will 
Is Free,” in G. Holmstrom-Hintkka and R Tuomela, eds , Contemporary Achon The 
ory, Volume 1 (Dordrecht: Kluwer, 1997), pp. 219-32 van Inwagen responds to Fir 
cher in “When Is the Will Not Free?” Philosophical Studses, xxv (1994): 95-114; and 


I respond in The Signsficance of Pree Will, pp. 32-48 
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then existing characters and motives. On such occasions, to do 
otherwise by chance would be a fluke or accident, irrational and 
inexplicable, as critics of indeterminist freedom contend (in (3) 
and (4) above). 

Incompatibilists about free will should not deny this. What they 
should rather say is that when we act from a will already formed 
(as we frequently do), it is “our own free will” by virtue of the fact 
that we formed it (at least in part) by earlier choices or actions 
which were not determined and for which we could have done 
otherwise voluntarily, not merely as a fluke or accident. I call these 
earlier undetermined actions self-forming actions or SFAs.® Unde- 
termined SFAs are a subset of all of the actions done of our own 
free wills (many of which may be determined by our earlier 
formed character and motives). But if there were no such undeter- 
mined SFAs in our lifetimes, there would have been nothing we 
could have ever voluntarily done to make ourselves different than 
we are—a condition that I think is inconsistent with our having 
the kind of responsibility for being what we are which genuine 
free will requires. 

Now, let us look more closely at these undetermined SFAs. As I 
see it, they occur at times in life when we are torn between compet- 
ing visions of what we should do or become. Perhaps we are torn be- 
tween doing the moral thing or acting from self-interest, or between 
present desires and long-term goals, or we are faced with difficult 
tasks for which we have aversions. In all such cases, we are faced 
with competing motivations and have to make an effort to over- 
come temptation to do something else we also strongly want. In the 
light of this picture, I suggest the following incompatibilist account 
of SFAs." There is a tension and uncertainty in our minds at such 
times of inner conflict which are reflected in appropriate regions of 
our brains by movement away from thermodynamic equilibrium— 
in short, a kind of stirring up of chaos in the brain that makes it sen- 
sitive to micro-indeterminacies at the neuronal level. As a result, the 
uncertainty and inner tension we feel at such soul-searching mo- 
ments of self-formation is reflected in the indeterminacy of our 
neural processes themselves. What is experienced phenomenolog? 
cally as uncertainty corresponds physically to the opening of a win- 


™ See The Significance of Pres Will, pp 74-78 SFAs are also sometimes called “self 
formung willings” or SFWs in that work (pp. 125ff.). 

» This, in broad outline, is the account developed in my The Significance of Fres 
Will, chapters 8-10. In later sections below, I make important additions to it in re- 
sponse to criticisms. 
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dow of opportunity that temporarily screens off complete determi- 
nation by the past. (By contrast, when we act from predominant mo- 
tives or settled dispositions, the uncertainty or indeterminism is 
muted. If it were involved then, it would be a mere nuisance or 
fluke, capricious or arbitrary, as critics contend (in (2), (5) and (6) 
above).) 

When we do decide under such conditions of uncertainty, the 
outcome is not determined because of the preceding indetermi- 
nacy—and yet it can be willed (and hence rational and voluntary) 
either way owing to the fact that in such self-formation, the agents’ 
prior wills are divided by conflicting motives. If we overcome temp- 
tation, it will be the result of our effort; and if we fail, it will be be- 
cause we did not allow our effort to succeed. And this is owing to the 
fact that, while we wanted to overcome temptation, we also wanted 
to fail, for quite different and incommensurable reasons. When we 
decide in such circumstances, and the indeterminate efforts we are 
making become determinate choices, we make one set of competing 
reasons or motives prevail over the others then and there by 
deciding. 

Return now to concerns about indeterminism and responsibility 
in the light of this picture. Consider a businesswoman who faces a 
conflict in her will of the kind typically involved in such SFAs. She 
is on the way to a meeting important to her career when she ob 
serves an assault in an alley. An inner struggle ensues between her 
moral conscience, to stop and call for help, and her career ambit 
tions, which tell her she cannot miss this meeting—a struggle she 
eventually resolves by turning back to help the victim. Now sup- 
pose this woman visits some future neuroscientists the next day 
and they tell her a story about what was going on in her brain at 
the time she chose, not unlike the story just told. Prior to choice, 
there was some indeterminacy in her neural processes stirred up 
by the conflict in her will. The indeterminism made it uncertain 
(and undetermined) whether she would go back to help or press 
onward. 

Suppose further that two recurrent and connected neural net- 
works are involved in the neuroscientists’ story. Such networks circu- 
late impulses and information in feedback loops and generally play a 
role in complex cognitive processing in the brain of the kind that 
one would expect to be involved in human deliberation. Moreover, 
recurrent networks are nonlinear, thus allowing (as some recent re- 
search suggests) for the possibility of chaotic activity, which would 
contribute to the plasticity and flexibility human brains display in 
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creative problem solving (of which practical deliberation is an exam- 
ple). The input of one of these recurrent networks consists of the 
woman’s moral motives, and its output the choice to go back; the in- 
put of the other, her career ambitions, and its output, the choice to 
go on to her meeting. The two networks are connected, so that the 
indeterminism that made it uncertain that she would do the moral 
thing was coming from her desire to do the opposite, and vice 
versa——the indeterminism thus arising, as we said, from a conflict in 
the will. When her effort to overcome self-interested desires suc- 
ceeded, this corresponded to one of the neural pathways reaching 
an activation threshold, overcoming the indeterminism generated by 
the other. 

To this picture, one might now pose the following objection: if it 
really was undetermined which choice the woman would make (in 
neural terms, which network would activate) right up to the moment 
when she chose, it seems that it would be a matter of luck or chance 
that one choice was made rather than the other, and so she could 
not be held responsible for the outcome. (Note that this is an ex- 
pression of LP.) The first step in response is to recall a point made 
earlier: we must be wary of moving too hastily from ‘indeterminism 
is involved in something’s happening’ to ‘its happening merely as a 
matter of chance or luck’. ‘Luck’ and ‘chance’ have meanings in or- 
dinary language that mere indeterminism may not have. The second 
step is to note that indeterminism of itself does not necessarily un- 
dermine control and responsibility.” Suppose you are trying to think 
through a difficult problem (say, a mathematical problem) and 
there is some indeterminacy in your neural processes complicating 
the task—a kind of chaotic background. It would be like trying to 
concentrate and solve a problem with background noise or distrac- 
tion. Whether you are going to succeed in solving the mathematical 
problem is uncertain and undetermined because of the distracting 
neural noise. Yet if you concentrate and solve the problem nonethe- 
less, I think we can say that you did it and are responsible for doing 


" See P Huberman and G. Hogg, “Phase Transitions in Artificial Intelligence 
Systems,” Artificial Intelhgence, OT (1987): 155-72; C. Skarda and W. Freeman, 
“How Brains Make Chaos in Order to Make Sense of the World,” Behavior and 
Bran Sciences, x (1987): 161-95; A. ents and A. Destexhe, “Strange Attractors 
in the Human Cortex,” in L. Rensing, ed, Temporal Disorder m Human Osallatory 
Systems (New York: Spnnger, 1985), pp. 182-43. 

rtant recent defenses of the claim that indeterminism does not necessar- 
uy undermine control and responsibility include Clarke, “Indeterminism and Con- 
trol,” American Philosophical Quarter, xxxi (1995): 125-88; Carl Ginet, On Action 
(New York: Cambridge, 1990), chapter 6, O’Connor; and Laura Ekstrom, Pres Wall 
(Boulder: Westview, forthcoming). 
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it even though it was undetermined whether you would succeed. The 
indeterministic noise would have been an obstacle to your solving 
the problem which you nevertheless overcame by your effort. 

There are numerous other examples in the philosophical litera- 
ture of this kind, where indeterminism functions as an obstacle to 
success without precluding responsibility. Consider an assassin who 
is trying to kill the prime minister, but might miss because of some 
undetermined events in his nervous system which might lead to a 
jerking or wavering of this arm. If he does hit his target, can he be 
held responsible? The answer (as J. L. Austin and Philippa Foot’ suc- 
cessfully argued decades ago) is “yes,” because he intentionally and 
voluntarily succeeded in doing what he was trying to do—kill the 
prime minister. Yet his killing the prime minister was undetermined. 
We might even say in a sense that he got lucky in killing the prime 
minister, when he could have failed. But it does not follow, if he suc- 
ceeds, that killing the prime minister was not his action, not some- 
thing he did; nor does it follow, as LP would require, that he was not 
responsible for killing the prime minister. Indeed, if anything is 
clear, it is that he both killed the prime minister and was responsible 
for doing so. 

Or consider a husband who, while arguing with his wife, swings his 
arm down in anger on her favorite glass table top, intending to 
break it. Again we suppose that some indeterminism in the hus- 
band’s efferent neural pathways makes the momentum of his arm in- 
determinate, so it is undetermined if the table will break nght up to 
the moment when it is struck. Whether the husband breaks the table 
or not is undetermined. Yet it does not follow, if he succeeds, that 
breaking the table was not something he did; nor again does it fol 
low, as LP would require, that he was not responsible for breaking 
it.” The inference sanctioned by LP from ‘it was undetermined’ to 
‘he was not responsible’, is not valid. The above cases are counterex- 


amples to it; and there are many more. 
IV. POSSIBLE WORLDS AND LP* 


But one may grant this and still object that counterexamples to LP of 
these kinds do not amount to genuine exercises of free will involving 


* Austin, “Ifs and Cans,” in his Philosophical Papers (New York: Oxford, 1961), 
pp. 153-80; Foot, “Free Will as Involving Determinism,” in Berofsky, ed., Fre Wil 
and Determinism (New York: Harper and Row, 1966), pp. 95-108. 

We must, of course, assume in both these examples that other (compatbilist) 
conditions for responsibility are in place—for example, that, despite his anger, the 
husband was not acting compulsively and would have controlled himself, if he had 
wished, that he knew what he was doing and was doing it intentionally to anger his 
wife, and so on (and similarly for the assassin). But the point is that nothing in the 
facts of either case preclude these assumptions from also being satisfied. 
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SFAs, such as the businesswoman’s, where there is conflict in the 
wills of the agents and they are supposed to choose freely and re- 
sponsibly whichever way they choose. If the assassin and husband suc- 
ceed in doing what they are trying to do (kill the prime minister, 
break the table) they will do it voluntarily (in accordance with their 
wills) and intentionally (knowingly and purposely). But if they fail be- 
cause of the indeterminism, they will not fail voluntarily and inten- 
tionally, but “by mistake” or “accident,” or merely “by chance.” Thus, 
their “power” to do otherwise (if we should even call it a power) is not 
the usual power we associate with freedom of choice or action in self- 
formation, where the agents should be able to choose or act either 
way voluntarily or intentionally. The power to do otherwise of the as- 
sassin and the husband is more like Jane’s “power” in (5) and (6) of 
section I, to choose to vacation in Colorado by a fluke or accident, 
after a long deliberation in which she had come to favor Hawaii. 

As a consequence, while LP may fail for cases like those of the as- 
sassin, husband, and mathematical problem solver, another luck 
principle similar to LP might still be applicable to genuine exercises 
of free will involving SFAs, like the businesswoman’s: if it is undeter- 
mined at t whether an agent voluntarily and intentionally does A at tor 
voluntarily and intentionally does otherwise, then the agent’s doing 
one of these rather than the other at t would be a matter of luck or 
chance, and so could not be a free and responsible action. This prin- 
ciple—let us call it LP*—is fueled by the same intuitions that fuel 
LP. Indeed, it is a special case of LP, but one that is more difficult to 
deal with because it is not subject to counterexamples like those of 
the husband and the assassin; and it seems to be applicable to SFAs, 
like the businesswoman’s, where failure is not merely a matter of 
mistake or accident. 

To explore further the difficulties posed by LP*, let us look at the 
final and, I think, most powerful of the intuition pumps in support 
of LP-type principles mentioned in section 1, namely, consideration 
(8). This was the argument of Strawson, Mele, Berofsky, Waller, Dou- 
ble, Bernstein, and Haji about two agents, or one agent in different 
possible worlds, with the same pasts. 

Consider the version of this argument by Mele, which appeared in 
this JOURNAL and is a particularly revealing and challenging version 
of it. In the actual world, an agent John succumbs to the temptation 
to arrive late to a meeting, whereas his counterpart, John*, in a 
nearby possible world, whose physical and psychological history is 
the same as John’s up to the moment of choice (as indeterminism 
requires), resists this temptation. Similarly, we can imagine a coun- 


f 
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terpart to the businesswoman, businesswoman *, in a nearby possible 
world who goes to her meeting instead of stopping to aid the assault 
victim, given the same past. But then, Mele argues, “if there is noth- 
ing about [these] agents’ powers, capacities, states of mind, moral 
character and the like that explains this difference in outcome,” 
since they are the same up to the moment of choice in the two possi- 
ble worlds, “then the difference is just a matter of luck” (op. at, p. 
583). It would seem that John* got lucky in his attempt to over- 
come temptation, whereas John did not; and similarly, the business- 
woman got lucky in her attempt to overcome temptation, while 
businesswoman* did not. 

Let us first consider a general form of this argument that would 
support LP. 


(a) In the actual world, person P (for example, John, the business- 
woman) does A att 


On the assumption that the act is undetermined at 4 we may imag- 
ine that: 


(b) In a nearby-possible world which is the same as the actual world up 
to 4 P* (P’s counterpart with the same past) does otherwise (does 
Batt 

(c) But then (since their pasts are the same), there is nothing about the 
agents’ powers, capacities, states of mind, characters, dispositions, 
motives, and so on prior to t which explains the difference in 
choices in the two possible worlds. 

(d) It is therefore a matter of luck or chance that P does A and P* does 
Batt 

(e) Pis therefore not responsible (praiseworthy or blameworthy, as the 
case may be) for A at ¢ (and presumably P* is also not responsible 
for B). 


Call this the luck argument. The key assumption is the assumption of 
indeterminism, which leads to step (b). The remaining steps are 
meant to follow from (b), given (a). 

Despite the fact that this argument looks like Mele’s and has an 
initial plausibility, it is not his argument—and it is a good thing it is 
not. For the argument from (a)-(e) is invalid as it stands—for the 
same reasons that LP was invalid. Consider the husband and hus- 


" I have elsewhere denied that the pasts of the agents can be exactly the same, 
since, with indeterminist efforts, there is no exact samenes or difference (The Sig- 
of Free Will, pp. 171-74). Mele’s argument is designed to work, however, 
whether this dental of exact sameness is assumed or not. So I do not make an issue 
of ıt here 


230 THE JOURNAL OF PHILOSOPHY 


band* (his counterpart in a nearby world who fails to break the 
wife’s table). If the outcome is undetermined, husband and hus- 
band* also have “the same powers, capacities, states of mind, charac- 
ters, dispositions, motives, and so on” up to the moment of breaking 
or not breaking the table, as the argument requires; and it is a mat 
ter of luck or chance that the table breaks in one world and not the 
other. But for all that, it does not follow, as (e) requires, that the 
husband is not responsible for breaking the table. The husband 
would have quite a task persuading his wife that he was not responst 
ble for breaking the table on the grounds that it was a matter of luck 
or chance that it broke. (“Luck or chance did it, not me” is an im- 
plausible excuse.) 

But, of course, as we noted, husband* is not also responsible for 
Jaling to break the table, since he does not fail to break it voluntarily 
or intentionally. He is responsible only for the attempt, when he 
fails. Similarly, assassin* would be responsible for the attempted 
murder of the prime minister, when he missed. What has to be ex- 
plicitly added to the argument (a)-(e) to avoid counterexamples like 
these is the LP* requirement that both P and P* voluntarii and inten- 
tionally do A and B respectively in their respective worlds. Specift 
cally, we must add to premise (a) that P voluntarily and intentionally 
does A at tand to (b), that P* voluntarily and intentionally does Bat 
t, and then make the corresponding additions to (d) and (e). This 
will yield what we might call the LP* version of the luck argument 
rather than the LP version. And the stronger LP* version is clearly 
the one Mele intends, since John’s choice in his example is supposed 
to be an SFA, like the businesswoman’s choice in my example, where 
the agents can go either way voluntarily and intentionally. Moreover, 
this version of the argument—like LP* itself—is immune to coun- 
terexamples like those of the husband and the assassin. 

V PARALLEL PROCESSING 

Nonetheless, despite immunity from these counterexamples, I think 
the LP* version of the luck argument, and LP* itself, also fail. But it 
is far less easy to show why. To do so, we have to take a closer look at 
SFAs and push the argument beyond where it has come thus far. Let 
it be granted that the businesswoman’s case and other SFAs like 
John’s are not like the examples of the husband and the assassin. 
The wills of the husband and assassin are already “set” on doing what 
they intend, whereas the wills of agents in SFAs, like the business- 
woman and John, are not already settled or “formed” until they 
choose (hence the designation “self-forming actions”). 


* See The Significance of Free Will, pp. 112-14. 
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Thus, to get from examples like those of the husband and assassin 
to genuine SFAs, I think we must do two things. First, we must put 
the indeterminacy involved in the efferent neural pathways of the 
husband and assassin into the central neural processes of the bust 
nesswoman and other agents, like John, who are making efforts of 
will to overcome moral, prudential, and other temptations. This 
move has already been made in earlier sections. But to respond to 
LP* versions of the luck argument, like Mele’s, I believe this move 
must also be combined with another—a kind of “doubling” of the 
example given earlier of solving the mathematical problem in the 
presence of background indeterministic noise.” 

Imagine that the businesswoman is trying or making an effort to 
solve two cognitive problems at once, or to complete two competing 
(deliberative) tasks at once—to make a moral choice and to make a 
choice for her ambitions (corresponding to the two competing 
neural networks involved in the earlier description). With respect to 
each task, as with the mathematical problem, she is being thwarted 
in her attempt to do what she is trying to do by indeterminism. But 
in her case, the indeterminism does not have a mere external 
source; it is coming from her own will, from her desire to do the op- 
posite. Recall that the two crossing neural networks involved are con- 
nected, so that the indeterminism which is making it uncertain that 
she will do the moral thing is coming from her desire to do the op- 
posite, and vice versa. She may therefore fail to do what she is trying 
to do, just like the assassin, the husband, and the person trying to 
solve the mathematical problem. But I argue that, if she nevertheless 
succeeds, then she can be held responsible because, like them, she 
will have succeeded in doing what she was tryng to do. And the inter- 
esting thing is that this will be true of her, whichever choice is made, be- 
cause she was trying to make both choices and one is going to 
succeed. 

Does it make sense to talk about agents trying to do two compet- 
ing things at once in this way? Well, we know the brain is a parallel 
processor and that capacity, I believe, is essential for the exercise of 
free will. In cases of self-formation, agents are simultaneously trying 
to resolve plural and competing cognitive tasks. They are, as we say, 


= This further “doubling” move 1s consistent with the theory put forward in The 
Significance of Pree Will, and presupposes much of that theory, but is not made in 
that work. It is a further development especially provoked by Mele’s argument dis- 
cussed here as well as by criticisms of other persons since the book’s publication, 
such as Strawson, Berofsky, Nicholas Nathan, Gary Watson, Clarke, O’Connor, 
Double, and Hay 
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of two minds. But they are not therefore two separate persons. They 
are not disassociated from either task.” The businesswoman who 
wants to go back and help the assault victim is the same ambitious 
woman who wants to go on to her meeting and close the sale. She is 
a complex creature, like most of us who are often torn inside; but 
hers is the kind of complexity needed for free will. And when she 
succeeds in doing one of the things she is trying to do, she will en- 
dorse that as Aer resolution of the conflict in her will, voluntarily and 
intentionally, as LP* requires. She will not disassociate from either 
outcome, as did Jane (in (5) of section 1), who wondered what “hap- 
pened to” her when she chose Colorado, or like the husband and as- 
sassin who did not also want to fail.” 

But one may still object that the businesswoman makes one 
choice rather than the other by chance, since it was undetermined 
right up to the last moment which choice she would make. If this is 
so, we may have the picture of her first making an effort to over- 
come temptation (to go on to her meeting) and do the moral 
thing, and then at the last minute “chance takes over” and decides 
the issue for her. But this is the wrong picture. On the view just de- 
scribed, you cannot separate the indeterminism from the effort to 
overcome temptation in such a way that first the effort occurs fob- 
lowed by chance or luck (or vice versa). One must think of the effort 
and the indeterminism as fused; the effort 1s indeterminate and the 
indeterminism is a property of the effort, not something separate 
that occurs after or before the effort. The fact that the woman’s ef- 
fort of will has this property of being indeterminate does not make 
it any less her effort. The complex recurrent neural network that re- 
alizes the effort in the brain is circulating impulses in feedback 
loops and there is some indeterminacy in these circulating im- 
pulses. But the whole process is her effort of will and it persists 
right up to the moment when the choice is made. There is no 
point at which the effort stops and chance “takes over.” She 


™ I account for this elsewhere in terms of the notion of a “selfnetwork” (The Sig- 
nificance of Pree Will, pp 137-42), a more comprehensarve network of neural connec- 
tons representing the general motivational system in terms of which agents define 
themselves as agents and practical reasoners. For further discussion of such a no- 
tion, see Owen Flanagan, Consacusness Reconndered (Cambridge: MIT, 1992), pp. 
207fF. 


claim (The Significance of Fræ Will, p. 215) that “free willers 
Ea o PR E ee a E ee 
lights,” by trying to overcome temptations of various sorts, Strawson asks: but “can’t 
they also try to be worse than they are?”°—" The Unhelpfulness of Indeterminism.” 
He 1s right, of course; they can. I should have added what I am saying here, that 
free willers can and do alse try to be as bad or worse than they are by resisting ef 
forts to be better. Strange creatures indeed. 
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chooses as a result of the effort, even though she might have failed 
because of the indeterminism. 

And just as expressions like “She chose by chance’ can mislead us 
in these contexts, so can expressions like ‘She got lucky’. Ask your- 
self this question: Why does the inference ‘He got lucky, so he was not 
responsible fail when it does fail, as in the cases of the husband and 
the assassin? The first part of an answer goes back to the claim that 
‘luck’, like ‘chance’, has question-begging implications in ordinary 
language which are not necessarily implications of “indeterminism” 
(which implies only the absence of deterministic causation). The 
core meaning of ‘He got lucky’, which is implied by indeterminism, I 
suggest, is that ‘He succeeded despite the probability or chance of failure’; 
and this core meaning does not imply lack of responsibility, if he suc- 
ceeds. 

If “He got lucky’ had further meanings in these contexts often 
associated with ‘luck’ and ‘chance’ in ordinary usage (for exam- 
ple, the outcome was not his doing, or occurred by mers chance, 
or he was not responsible for it), the inference would not fail for 
the husband and assassin, as it clearly does. But the point is that 
these further meanings of ‘luck’ and ‘chance’ do not follow from 
the mere presence of indeterminism. Second, the inference ‘He got 
lucky, so he was not responsible’ fails because what the assassin 
and husband succeeded in doing was what they were trying and 
wanting to do all along. Third, when they succeeded, their reac- 
tion was not “Oh dear, that was a mistake, an accident—some- 
thing that happened to me, not something I did.” Rather, they 
endorsed the outcomes as something they were trying and wanting 
to do all along, that is to say, knowingly and purposefully, not by 
mistake or accident. 

But these conditions are satisfied in the businesswoman’s case as 
well, either way she chooses. If she succeeds in choosing to return to 
help the victim (or in choosing to go on to her meeting) (i) she will 
have “succeeded despite the probability or chance of failure”; (i) 
she will have succeeded in doing what she was trying and wanting to 
do all along (she wanted both outcomes very much, but for different 
reasons, and was trying to make those reasons prevail in both cases); 
and (iii) when she succeeded (in choosing to return to help) her re- 
action was not “Oh dear, that was a mistake, an accident—something 
that happened to me, not something I did.” Rather, she endorsed 
the outcome as something she was trying and wanting to do all 
along; she recognized it as her resolution of the conflict in her will. 
And if she had chosen to go on to her meeting she would have en- 
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dorsed that outcome, recognizing it as her resolution of the conflict 
in her will. 
VI THE LUCK ARGUMENT REVISITED 

With this in mind, let us return to the LP* version of the argument 
from (a)-(¢). I said that Mele clearly intends this stronger LP* ver- 
sion of the argument, since the force of his argument depends on 
the fact that John’s choice in his example is a SFA, like the business- 
woman's, instead of being like the actions of the husband and assas- 
sin. But if this is so, then John's situation will also be like the 
businesswoman’s on the account just given of SFAs. Since both of 
them are simultaneously trying to do both of the things they may do 
(choose to help or go on, overcome the temptation to arrive late or 
not), they will do either with intent or on purpose, as a result of 
wanting and trying to do it-—-that is, intentionally and voluntarily. 
Thus, their “failing” to do one of the options will not be a mistake or 
accident, but a voluntary and intentional doing of the other. 

Likewise, businesswoman* and John* are simultaneously trying to 
do both things in their respective worlds; and they will not “fail” to 
act on moral or weak-willed motives by mistake or accident, as the 
case may be, but by voluntarily and intentionally choosing to act on 
the opposing motives. The point is that in self-formation of these 
kinds {SFAs), failing is never just failing; it is always also a succeeding 
in doing something else we wanted and were trying to do. And we 
found that one can be responsible for succeeding in doing what one 
was trying to do, even in the presence of indeterminism. So even if 
we add the LP* requirement of more-than-one-way voluntariness and 
intentionality to the argument of (a)-(e), the argument remains in- 
valid for cases like the businesswoman’s and other SFAs, like John’s. 

But one might argue further, as Mele does, that John and John* 
(and businesswoman and businesswoman") not only had the same 
capacities, motives, characters, and the like prior to choice, but they 
made exactly the same efforts as well. And this does seem to suggest 
that the success of one and failure of the other was a matter of mere 
luck or chance, so that John and the businesswoman were not re- 
sponsible. But again the inference is too hasty. Note, first, that hus- 
band and husband* also made the same efforts (as well as having the 
same Capacities, motives, and characters) up to the very moment of 
breaking of the table. Yet it does not follow that the husband is not 
responsible when he succeeds. And both the businesswoman and 
businesswoman", and John and John’, are in the position of the hus- 
band in their respective worlds, since both will have succeeded in do- 
ing what they were trying to do. 


RESPONSIBILITY, LUCK, AND CHANCE 235 


But one may still want to object: if the businesswoman and busi- 
nesswoman*, and John and John*, make exactly the same efforts, 
how can it not be a matter of chance that one succeeds and the 
other does not, in a way that makes them not responsible? To 
which I reply: But if they both succeeded in doing what they were 
trying to do (because they were simultaneously trying to do both 
things), and then having succeeded, they both endorsed the out- 
comes of their respective efforts (that is, their choices) as what they 
were trying to do, instead of disowning or disassociating from those 
choices, how then can we not hold them responsible? It just does 
not follow that, because they made exactly the same efforts, they 
chose by chance. 

To say something was done “by chance” usually means (as in the 
assassin and husband cases when they fail), it was done “by mis- 
take” or “accidentally,” “inadvertently,” “involuntarily,” or “as an 
unintended fluke.” But none of these things holds of the business- 
woman and John either way they choose. Unlike husband*, busi- 
nesswoman* and John do not fail to overcome temptation by 
mistake or accident, inadvertently or involuntarily. They con- 
sciously and willingly fail to overcome temptation by consciously 
and willingly choosing to act in selfish or weak-willed ways. So, just 
as it would have been a poor excuse for the husband to say to his 
wife when the table broke that “Luck or chance did it, not me,” it 
would be a poor excuse for businesswoman* and John to say “Luck 
or chance did it, not me” when they failed to help the assault vic- 
tim or failed to arrive on time. 

Worth highlighting in this argument is the point that we cannot 
simply say the businesswoman and businesswoman*® (or John and 
Jobn*) made exactly the same effort (in the singular) in their respec- 
tive possible worlds and one succeeded while the other failed. We 
must say they made exactly the same efforts (plural) in their respec- 
tive worlds. Mentioning only one effort prejudices the case, for it 
suggests that the failure of that effort in one of the worlds was a mere 
mistake or accident, when the fact is that both of the agents (P and 
P*) made both efforts in both worlds. In one world, one of the efforts 
issued in a choice and in the other world, a different effort issued in 
a different choice; but neither was merely accidental or inadvertent 
in either world. I would go even further and say that we may also 
doubt that the efforts they were both making really were exactly the 
same. Where events are indeterminate, as are the efforts they were 
making, there is no such thing as exact sameness or difference of 
events in different possible worlds. Their efforts were not exactly the 
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same, nor were they exactly different, because they were not exact. 
They were simply unique.” 

One might try another line: perhaps we are begging the question 
in assuming that the outcomes of the efforts of the businesswoman 
and her counterpart were choices at all. If they were not choices to be- 
gin with, they could not have been voluntary choices. One might ar- 
gue this on the grounds that (A) “Lf an event is undetermined, it 
must be something that merely happens and cannot be somebody’s 
choice”; and (B) “If an event is undetermined, it must be something 
that merely happens, it cannot be something an agent does (it cannot 
be an action).” But to see how question-begging these assumptions 
are, one has only to note that (A) and (B) imply respectively (A’) “If 
an event is a choice, it must be determined” (“All choices are deter- 
mined”) and (B’) “If an event is an action, it must be determined” 
(“All actions are determined”). Are these supposed to be a priori or 
analytic truths? If so, then long-standing issues about freedom and de- 
terminism would be settled by fiat. If an event were not determined, 
it could not be a choice or action necessarily or by definition.™ 

This explains the businesswoman’s suspicions when she exited the 
neuroscientists’ offices. They told her that when she “chose” to go 
back to help the assault victim the day before, there was some inde- 
terminism in her neural processes prior to choice. She accepted this 
as a correct empirical finding. But she was suspicious when the neu- 
roscientists tried to get her to make the further inference from those 
findings which she did not really choose to help the assault victim yes- 
terday. She refused to accept that conclusion, and rightly so. For in 
drawing it, they were going beyond their empirical findings and try 
ing to foist on her the a priori assumption that if an event was unde- 
termined, it could not have been her choice or could not have been 
something she did. She rightly saw that there was nothing in the em- 
pirical evidence that required her to say that. To choose is con- 
sciously and deliberately to form an intention to do something; and 
she did that, despite the indeterminism in her neural processes (as 
did businesswoman* when she chose to go on to her meeting). 

VL FINAL CONSIDERATIONS: CONTROL AND EXPLANATION 


But it is one thing to say that she chose and another to say she chose 
freely and responsibly. This would require that she not only chose, but 
had voluntary control over her choice either way. We have not talked 
at length to this point about the matter of control (considerations (1) 


™ See The Significance of Pree Will, pp. 171-74. 
™ Ild, pp. 188-86, for a fuller account of why indeterminism does not rule out 
action or choice. 


RESPONSIBILITY, LUCK, AND CHANCE 237 


and (3) of section 1) and must now do so. For this may be the reason 
why we may think the choices made by the businesswoman and busi 
nesswoman* (or John and John*) could not be responsible, if they 
were undetermined. We might deny that they had voluntary control 
over what they chose, where voluntary control means being able to 
bring about something in accordance with one’s will or purposes (or, 
as we often say, the ability to bring something about “at will”). 

One thing does seem to be true about control which critics of inde- 
terminist freedom have always maintained: indeterminism, wherever 
it appears, does seem to diminish rather than enhance agents’ volun 
tary control (consideration (3) of section n). The assassin’s voluntary 
control over whether or not the prime minister is killed (his ability to 
realize his purpose or what he is trying to do) is diminished by the 
undetermined impulses in his arm—and so also for the husband and 
his breaking the table. Moreover, this limitation is connected to an 
other, which I think we must also grant—that indeterminism, wher- 
ever it occurs, functions as a Aindrance or obstacle to our purposes that 
must be overcome by effort (consideration (3)). 

But recall that in the businesswoman’s case (and for SFAs gener- 
ally, like John’s), the indeterminism that is admittedly diminishing 
her ability to overcome selfish temptation, and is indeed a hindrance 
to her doing so, is coming from her own will—from her desire and e£ 
fort to do the opposite—since she is simultaneously trying to realize 
two conflicting purposes at once. Similarly, her ability to overcome 
moral qualms is diminished by the fact that she also simultaneously 
wants and is trying to act on moral reasons. If we could look at each 
of the two competing neural networks involved separately, abstracting 
from the other, the situation would look analogous to the situations 
of the husband and the assassin. The agent would be trying to do 
something while being hindered by indeterminism coming from an 
external source. But, in fact, we cannot look at the two networks sepa- 
rately in this way because, in reality, they are connected and interact- 
ing. The indeterminism that is a hindrance to her fulfilling one is 
coming from its interactions with the other. The indeterminism, 
therefore, does not have an external source. It is internal to her will, 
and hence to her self, since she identifies with both networks and will 
identify with the choice reached by either of them as her choice. 

The upshot is that, despite the businesswoman’s diminished con- 
trol over each option considered separately, due to a conflict in her 
will, she nonetheless has what I call plural voluntary control over the 
two options considered as a set (1hid., pp. 13443). Having plural vol 
untary control over a set of options means being able to bring about 
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whichever of the options you will or most want, when you will to do so, 
for the reasons you will to do so, without being coerced or compelled 
in doing so. And the businesswoman (or John) has this power, be- 
cause whichever of the options she chooses (to help the victim or go 
on to her meeting) will be endorsed by her as what she wills or most 
wants to do at the moment when she chooses it (though not necessar- 
ily beforehand); she will choose it for the reasons she most wants to 
act on then and there (moral or selfish reasons, as the case may be); 
she need not have been coerced by anyone else into choosing one 
rather than the other; and she will not be choosing either compul 
sively, since neither choice is such that she could not have chosen it 
then and there, even if she most wanted to.” 

One must add, of course, that such plural voluntary control is not 
the same as what may be called antecedent determining controt—the abil 
ity to determine or guarantee which of a set of options will occur before 
it occurs (ihid., p. 144). With respect to undetermined self-forming 
choices (SFAs), agents cannot determine or guarantee which choice 
outcome will occur beforehand, for that could only be done by predeter- 
mining the outcome. But it does not follow that, because one cannot 
determine which of a set of outcomes will occur before it occurs, one 
does not determine which of them occurs when it occurs. When the 
conditions of plural voluntary control are satisfied, agents exercise 
control over their present and future lives then and there by deciding. 

But can we not at least say that, if indeterminism is involved, then 
which option is chosen is “arbitrary”? I grant that there is a sense in 
which this is true. An ultimate arbitrariness remains in all undeter- 
mined SFAs because there cannot, in principle, be sufficient or over- 
riding pror reasons for making one set of competing reasons prevail 
over the other. But I argue that such arbitrariness relative to prior 
reasons tells us something important about free will. It tells us, as I 
have elsewhere expressed it, that every undetermined self-forming 
choice (SFA) “is the initiation of a ‘value experiment’ whose justifi- 
cation lies in the future and is not fully explained by the past [Mak- 
ing such a choice], we say in effect, ‘Let’s try this. It is not required 
by my past, but is consistent with my past and is one branching path- 
way my life could now meaningfully take. I am willing to take respon- 
sibility for it one way or the other’ * (1hid,, pp. 145-46). To initiate 
and take responsibility for such value experiments whose justifica- 
tion lies in the future, is to “take chances” without prior guarantees 
of success. Genuine self-formation requires this sort of risk-taking 
and indeterminism is a part of it. If there are persons who need to 


= Ibid, pp. 183-38, where a more detailed case is made for each of these claims. 
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be certain in advance just exactly what is the best or night thing to do 
in every circumstance (perhaps to be told so by some human or di 
vine authority), then free will is not for them. 

This point also throws light on why the luck argument fails, even in 
the stronger LP* version, despite its initial plausibility. Consider the 
move from step (c)—-the agents Pand P* have the same powers, char- 
acters, motives, and the like, prior to tin the two possible worlds—to 
step (d), which says it was a matter of luck or chance that Pdid A and 
P* did Bat & An important reason given for this move was that, if 
both agents have all the same prior powers, characters, motives, and 
the like, there can be no “explanation of the difference in choice” be- 
tween the two agents in terms of their prior reasons or motives; and 
this is taken to imply that the difference in choices in the two worlds is 
a matter of luck or chance im a way that precludes responsibility. 

But this move, like others discussed earlier, is too hasty. The ab- 
sence of an explanation of the difference in choice in terms of prior 
reasons does not have the tight connection to issues of responsibility 
one might initially credit it with. For one thing, the absence of such 
an explanation does not imply (as I have been arguing throughout 
this paper) that businesswoman and businesswoman* (John and 
John*) (1) did not choose at all, nor does it imply that they did not 
both choose (2) as a result of their efforts, nor that they did not choose 
(3) for reasons (different reasons, of course) that (4) they most 
wanted to choose for when they chose, nor that they did not choose 
for those reasons (5) knowingly and (6) on purpose when they chose, 
and hence (7) rationali, (8) voluntarily, and (9) intentionally. None of 
these conditions is precluded by the absence of an explanation of 
the difference of choice in terms of prior reasons. Yet these are pre- 
cisely the kinds of conditions we look for when deciding whether or 
not persons are responsible. 

I suggest that the reason why these conditions are not excluded is 
that the explanation of the difference of choice in the two possible 
worlds which is missing is an explanation in terms of sufficient or con 
clustve reasons—one that would render an alternative choice, given 
the same prior reasons, irrational or inexplicable. And, of course, 
thai sort of explanation is not possible for undetermined SFAs, when 
there is conflict in the will and the agent has good (but not decisive 
or conclusive) prior reasons for going either way. But neither is that 
sort of explanation required to say that an agent acts as the result of 
her effort for reasons she most wants to act on then and there. In 
sum, you can choose responsibly for prior reasons that were not conclusive or 
decistve prior to your choostng for them. 
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I said a moment ago that such arbitrariness relative to prior rea- 
sons tells us something important about free will—that every self- 
forming choice is the initiation of a value experiment whose 
justification lies in the future and cannot be fully explained by the 
past. It is worth adding in this regard that the term ‘arbitrary’ comes 
from the Latin arbitrium, which means ‘judgment’—as in liberum arbi- 
trium voluntatis (“free judgment of the will”)—the medieval designa- 
tion for free will. Imagine a writer in the middle of a novel. The 
novel’s heroine faces a crisis and the writer has not yet developed 
her character in sufficient detail to say exactly how she will react. 
The author must make a “judgment” (arbitrium) about how she will 
react that is not determined by the heroine’s already formed past, 
which does not give unique direction. In this sense, the author’s 
judgment of how she will act is “arbitrary,” but not entirely so. It has 
input from the heroine’s fictional past and, in turn, gives input to 
her projected future. 

In a similar manner, agents who exercise free will are both authors 
of, and characters in, their own stories at once. By virtue of “self- 
forming” judgments of the will (arbitria voluntatis), they are “arbiters” 
of their own lives, taking responsibility for “making themselves” out 
of past that, if they are truly free, does not limit their future pathways 
to one. If someone should charge them with not having a sufficient 
or conclusive prior reason for choosing as they did, they may reply as 
follows: “Perhaps so. But that does not mean I did not choose and it 
does not mean I did not choose for good reasons, which I stand by 
and for which I take responsibility. If I lacked sufficient or conclusive 
prior reasons, that is because, like the heroine of the novel, I was not 
a fully formed person before I chose—and still am not, for that mat- 
ter.” Like the author of the novel, I am in the process of writing a 
story and forming a person (who, in my case, is myself). It is a heavy 
burden, but an eminently human one.” 

ROBERT KANE 
University of Texas/ Austin 


= Jan Branson (in “Alternatives of Oneself,” Philosophy and Phenomenological Re- 
search (forthcoming)) has made an important distnction that is relevant here—be- 
tween chooang “alternatrves for oneself” and choosing “alternatives of oneself” 
Branson notes that some choices in life are for different courses of action that will 
make a difference in what sort of person the chooser will become in future. In 
such cases, agents are not merely choosing alternatives for themselves but are 
choosing alternatives of themselves. Many SFAs, as I understand them, would be of 
this kind. 
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INTENTION AND WEAKNESS OF WILL* 


ck is something curious about the philosophical literature 
on weakness of will. It is not about what one might expect. 
Even David Wiggins,’ in a discussion that has much in con- 
mon with that to be grven here, starts out by claiming: 


Almost anyone not under the influence of theory will say that, when a 
person is weak-willed, he intentionally chooses that which he knows or 
believes to be the worse course of action when he could choose the bet- 
ter course (thid., p. 239). 


I do not agree that this is the untutored view. Whenever I have 
asked nonphilosophers what they take weakness of will to consist 
in, they have made no mention of judgments about the better or 
worse course of action. Rather, they have said things like this: 
weak-willed people are irresolute; they do not persist in their in- 
tentions; they are too easily deflected from the path that they have 
chosen. My aim here is to pursue this line of thought. I shall de- 
velop the idea that the central cases of weakness of will are best 
characterized not as cases in which people act against their better 
judgment, but as cases in which they fail to act on their intentions. 
Of course, to say this is not to say enough. Not every case of a faib 
ure to act on one’s intentions is a case of weakness of will. Some- 
times, we realize that our intentions were ill-judged, or that 
circumstances have changed to make them inappropriate. When, 
in such cases, we fail to act on our intentions, we need not display 
weakness of will. So a large part of the discussion below will be 
concerned with saying what kind of failure to act on one’s inten- 
tion does constitute weakness of will. The basic idea is this. Just as 
it takes skill to form and employ beliefs, so it takes skill to form 
and employ intentions. In particular, if intentions are to fulfil 
their function, they need to be relatively resistant to reconsidere 
tion. Weakness of will arises, I shall suggest, when agents are too 
ready to reconsider their intentions. 


* This paper was written while I was a fellow at the Research School of Social 
Sciences, Australian National University. Thanks to audiences there, and at 
Monash University, Sydney Univermty, and Sheffield University for discussion; and 
especially to Sam Guttenplan, Lloyd Humberstone, Rae Langton, David Lewis, 
Philp Pettit, and Michael Smith. 

1 “Weakness of Will, Commensurability, and the Objects of Deliberation and De- 
sire,” in his Needs, Values, Truth (Cambridge. Basil Blackwell, 1987), pp 239-67. 
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In taking this approach, I depart from almost all of the literature 
on the subject.” Since I find the approach obvious, this calls for some 
explanation. I suspect that there are two factors at work. First, there 
has been, until quite recently, a widespread suspicion of the notion 
of intention. Some have said that intentions are a philosopher’s in- 
vention, a misleading reification of the legitimate talk of intentional 
action. Others, while prepared to concede a place to intentions, 
have held them to be reducible to beliefs and desires, or else to be- 
liefs and desires and their relations to action. Such considerations 
have led many philosophers to think that an account of weakness of 
will must ultimately be couched in the language of beliefs and de- 
sires. The traditional understanding of weakness of will as failure to 
do what one judges to be best is couched in this language; so, too, 
are the main rival accounts.’ In the last few years, however, a number 
of philosophers have argued that talk of intentions is both legitimate 
and irreducible: I have in mind especially the work of Michael Brat- 
man and Gilbert Harman.‘ If they are right that such talk is legit 
mate, then there is no requirement to analyze weakness of will in terms 
of belief and desire rather than in terms of intention. That is all that 


* Exceptons: Wigmns (atd.) goes on to sketch an account of weakness of will 
that does te it to failure to persist in one’s intentions. Alfred Mele—Arattonahty 
(New York: Oxford, 1987)—discusses cases of weakness of will in which one acts 
against one’s intention, but he still treats this as a special (and specially problem- 
atic) case of action against one’s best judgment, that is, these are cases in which 
one judges a particular course as right, and then forms the intention to pursue it, 
and then fails to act on it. In contrast, I suggest that action against one’s judgments 
about what 1s nght need not enter into examples of weakness of will at all. Amelie 
Rorty insists in various articles that weakness of will should not be identified with 
akrasia, she also charts a number of places at which the “akratic break” can take 
place, including at the point between forming an intention and acting on 1t— 
“Where Does the Akratic Break Take Placer” Australanan fournal of Philosophy, LVOI 
(1980): 333-46. I discuss some of the issues raised by her paper below Christopher 
Peacocke argues that an account of intention is essential to an account of akrana— 
“Intention and Akrana,” in Bruce Vermazen and Merril B. Hintkka, eds., Essays on 
Davidson: Action and Events (New York. Oxford, 1985), pp. 51-73; but his project is 
very different from that offered here. Roughly, he is concerned with the idea that 
akratıc action results from formmmg an misnton (instead of simply performing an ac- 
tion) that is not in line with one’s judgment as to what is best. 

> Namely, accounts that analyze weakness of will in terms of a failure to bnng 
one’s lower-order desires into line with one’s higher order desires, as presented in 
John Bigelow, Susan Dodds, and Robert Pargetter, “Temptation and the Will,” 
American Philosophical Quarter, xxv (1990): 39-49, following Richard C. Jeffrey, 
“Preference among Preferences,” this JOURNAL, LXXI, 13 (J 18, 1974): 377-91; 
and accounts that analyze weakness of will in terms of desire revision that does not 
result from conditonalization, as presented in Frank Jackson, “Weakness of Will,” 
Mend, xcm (1984): 1-18. 

1 Bratman, Intention, Plans, and Practical Reason (Cambridge: Harvard, 1987); 
Harman, Change m View (Cambridge: MIT, 1986). 
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is needed for the account offered here. If, in addition, they are right 
that talk of intentions is irreducible, then it will not be possible to 
rephrase the account offered here in terms of beliefs and desires. 
This, however, is an issue that I shall not broach; although I am sym- 
pathetic to the irreducibility claim, I shall not presuppose it in what 
follows. 

The second factor which has influenced the lines of the discussion 
is the impetus given by Plato and Aristotle. Both clearly were con- 
cerned with the question of whether it is possible to act against one’s 
best judgment; and much of the contemporary literature has been 
concerned with assessing and responding to their arguments. Let us 
grant the term ‘akrasta’ to refer to action voluntarily undertaken 
against one’s best judgment; it is, after all, scarcely a term of ordi- 
nary English. Then my contention will be that weakness of will is not 
akrasta. 

The outline of my discussion is as follows. In section I, I give an ac- 
count of intentions, relying largely on Bratman’s work, but going be- 
yond him in some areas. I show in section 0 how this leads naturally 
to an account of weakness of will. In section m, I chart the consider- 
able advantages that this account has over the traditional account 
that identifies weakness of will with akrasia. I reply to some possible 


objections in section Iv. 
I INTENTIONS 


Bratman’s account of intentions has three parts. He identifies the 
characteristics that a state needs to have if we are to count it as an 
intention; he explains the useful role that states with these charac- 
teristics would have in our lives; and he gives reasons for thinking 
that beliefs and desires on their own could not fulfill this role. 
Taken together, these provide the basis for the case that we should 
acknowledge the existence of intentions. For if we seem to have 
them, and if having them would play a useful role, and if indeed 
they do seem to play this useful role, then the burden of proof rests 
in their favor. We shall need compelling arguments before giving 
them up. 

J shall not be concerned here with the third element of Bratman’s 
argument: the contention that beliefs and desires cannot play the 
role allotted to intentions. Readers in search of that can read Brat- 
man’s book. But the other two elements of his argument—the ac- 
count of the distinguishing marks of intentions, and of the useful 
role that intentions can play—will be central to my account of weak- 
ness of will. Let me start by identifying the distinguishing marks; in 
particular, let me start with the distinguishing marks of future-directed 
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intentions (that is, intentions to do something at some future time). 
I shall return to consider present-directed intentions later. 

A future-directed intention to perform a certain action typically 
has the following characteristics. The agent forms the intention at 
one time by making a decision to perform the action. Then, unless it 
is revised, the intention will directly lead the agent to perform the 
action; it is, as Bratman says, controlling (op. cit., p. 16). Moreover, it is 
relatively immune to reconsideration and hence to revision. Once 
formed, intentions have a tendency to persist. They have what Brat- 
man Calls stability (op. at. pp. 16, 65). 

What need do we have of states that are stable and controlling in 
this way? Part of Bratman’s answer stems from the observation that 
we are epistemically limited creatures. Information is scarce and 
costly to obtain. Reasoning on the information that we have takes 
time and effort. It is rational then to limit the amount of time we 
spend looking for information and reasoning on it. We should doa 
certain amount of searching and reasoning, and then stop and make 
a decision on the basis of that. Now, if we were to act immediately on 
making the decision, then this would not in itself give rise to a need 
for intentions. But suppose our decision is a decision to act tomor- 
row. Then we need some way of storing our decision so that we act 
on it tomorrow without reconsidering it (for to reconsider it would 
violate the requirement that we not reason about it further). In 
short, we need a state that is stable and controlling: an intention to 
perform the action that is relatively immune to revision, and will in 
itself be enough to move us when the time comes. 

This, in turn, raises another question: Why should we want to 
make a decision now about how we shall act tomorrow’? It could be 
that now is a more propitious moment to reason. Tomorrow, there 
will be some factor that will stop us from reasoning well: we shall be 
short of time, or distracted, or some such. But a more common rea- 
son for doing our reasoning ahead of time is that all sorts of other 
actions will be dependent upon what we decide to do; and we shall 
need to perform some of these actions in the meanwhile. Thus, sup- 
pose I want to paint my front door tomorrow. Then there might be 
many reasons for deciding today what color to paint it. It could be 
that tomorrow I shall start at the crack of dawn when there will be 
too little light to see the color charts properly, that will give me a rea- 
son of the first kind for doing my deliberation today. More probably, 
the paint shop will be closed tomorrow, so I should buy the paint to- 
day; and since I want only to buy the color that I shall use, I shall 
need to have decided today what color that will be. I need to form 
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my intention ahead of time. Thus, intentions are necessary in the im 
trapersonal case for agents to coordinate their activities. They are also 
important in the interpersonal case. Suppose some friends are coming 
to visit me tomorrow, and the easiest way for them to recognize my 
house is from the color of the front door. I shall be speaking with 
them today, but not again before they visit. Then, once again, I have 
a reason for forming today an intention about which color I shall 
paint my door tomorrow, so that I can tell them what to look for. 

It will be important to my argument to realize that there can be 
good reason for forming intentions now, even in cases where I can- 
not make my decision just on the basis of rational deliberation. One 
such case involves Buridan-style examples: if I am really indifferent 
between red and green for my door, it is nonetheless important that 
I decide on one of them, and form the corresponding intention. 
Otherwise, I shall not know what paint to buy, or what to tell my vist 
tors. A more interesting case concerns options that the agent finds 
incommensurable. Suppose (taking an example at random) that I 
do not know how to compare the demands of leaving home to fight 
fascism with the demands of staying to look after my mother. Here, 
the problem is not that each outcome fares equally well on the same 
scale; I do not know how to place them on the same scale. Neverthe- 
less, I need to decide on one course, and to form the corresponding 
intention; both intrapersonal and interpersonal coordination re- 
quire this. I need to know whether to start winding up my affairs and 
packing my bag; and I need to know what to tell my mother so that 
she too can make her plans accordingly.’ 

I have sketched the main lines of Bratman’s argument for the utility 
of intentions. Once we have got this far, it is easy to see that they might 
be useful in other roles, too, roles that Bratman does not discuss. Sup- 
pose that there is an act that I now believe I should perform at some 
time in the future; and suppose I know that when the time comes, I 
shall not want to perform it. Then it would be useful to form an inter 
tion now, an intention which will lead me directly to act when the time 
comes, and which will be somewhat immune to reconsideration in the 
light of the desires I shall have then. Philosophers have been keen on 


* Bratman points out that cases like these show the utility of ntentions even in 
cases where information is not limited; for we have reason to think that Buridan 
cases will arise even for agents who know everything. He also argues, I think con- 
vincingly, that they show that we cannot reduce talk of intentions to talk of beliefs 
and demres. By hypothesis, I do not desire that I take one course rather than the 
other; and my intention to take one course cannot be seen as a belief that I will 
take it See Bratman, “Davidson's Theory of Intention,” in Essays on Davidson: Ac- 
tons and Events, pp. 13-26, here pp. 22-23; Intention, Plans, and Practscal Reason, pp. 
11, 22ff. 
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the idea that sometimes an intention will not be enough; that we will 
need to bind ourselves in some further way to fulfil our intentions, 
perhaps by placing temptation out of our reach, or by telling others of 
our plans so that fear of their disapproval should we fall short will pro- 
vide an extra incentive.® But we should not let our interest in these ex- 
otic methods blind us to the fact that very often, intention is enough. 
People get up on cold dark mornings, leave enjoyable lunches to re- 
turn to work, even give up smoking, using no mechanism other than 
an intention to overcome their contrary desires.’ 

The use of intention to overcome present desire becomes espe- 
cially important in cases that involve repeated actions. | reason that 
since smoking forty cigarettes a day for the rest of my life will make a 
considerable difference to my chance of getting lung cancer, I 
should give it up. But should I deny myself the cigarette I was about 
to have? Smoking one cigarette will make virtually no difference to 
my chances of getting lung cancer. So why should I deny myself? Un- 
fortunately, the same argument will work just as well forty times a day 
for the rest of my life. What we need here is not just an intention to 
perform—or, in this case, to refrain from performing—a specific ac- 
tion. We need a general intention concerning a certain type of ac- 
tion; what Bratman calls a policy,’ and I shall call a polscy intention. 

To conclude this section, let us return to an issue shelved at its be- 
ginning: Are there such things as preseni-directed intentions? All the 
considerations raised so far have concerned future-directed inten- 
tions, that is, intentions to do something in the future. Do they give 
us any reason for accepting intentions to perform actions straight 
away? I rather doubt that they do; however, there is a slight theorett 
cal elegance in accepting present-directed intentions. I spoke of 
future-directed intentions as resulting from a decision to act. I do not 
mean by this that there are two distinct states: the decision and the in- 
tention. Rather, to form the intention just is to make the decision. 


t Jon Elster, Ulysses and the Sirens (New York: Cambridge, 1979). I suspect that 
very often the point of telling others of our intentions 1s not to incur their scorn if 
we revise them. Rather, ıt 1s simply to remind us of what our intentions were, or of 
how seriously we made them. A midwife told me of the entry on one woman’s birth 
plan: “If I ask for pain relief ın labor, show me this birth plan.” The woman did ask 
for pain relief, the midwife showed her the birth plan, and the woman decided 
against taking any. 

” Bratman briefly mentions examples in which intentions are used to overcome 
future resistance, but argues that we get a better understanding of the central cases 
of intention by ignonng them—J/niention, Plans, and Practical Reason, p. 12. I sus 
pect that such examples do provide central cases of intention; what we do well to 
ignore are examples in which intention 15 not enough. 

1! Intention, Plans, and Practical Reason, pp. 87-91. 


INTENTION AND WEAKNESS OF WILL 247 


(This is not to say that you cannot hawe an intention without having 
made a decision. You might have been born with the intention; or it 
might have been implanted by some manipulator using fiendish 
methods. But if so, you did not form the intention.) Then we might 
want to say that a decision to act now also results in an intention, but 
a present-directed one. A number of theorists have taken this path.’ 
It seems to me to be slightly at odds with ordinary usage, but innocu- 
ously so; and it will make my account simpler if I adopt it here. So 
when I speak of intentions, these can be either present- or future- 
directed. 
Il. WEAKNESS OF WILL 

I have argued that we have a need for intentions, that 1s, that we 
have a need for action-guiding states that are not readily revised. My 
suggestion now is that this gives the basis for an account of weakness 
of will. A person exhibits weakness of will when she revises her inten- 
tion, in circumstances in which she should not have revised it. This 
‘should’ is not meant in a moral sense. Rather, it is the ‘should’ that 
is generated by the norms of the skill of managing one’s intentions. 
A person is weak willed if she revises her intentions too readily. 

What is it to be too ready to revise an intention? Clearly, not every 
case of a revision is a case of an over-ready revision. If you and I in- 
tend to go for a picnic, and it starts to rain, we might be quite right 
to revise our intention, and in doing so we need not show weakness 
of will. (I only say we ‘need not’ show weakness of will in such a case, 
for I have left it underdescribed; if we had vowed to go whatever the 
weather, then we probably would have been weak willed in giving up 
when it rained.) It might be thought that we can accommodate cases 
like these by invoking conditional intentions. Even if we did not say 
so explicitly, it might be thought that our intention was the condi 
tional one of going for a picnic provided it did not rain. But not 
every reasonable revision of an intention can be understood as the 
triggering of an escape clause in a conditional intention. We would 
be equally right to abandon our intention in the face of a freak 
plague of frogs falling from the sky; but neither of us considered 
that, even implicitly. We need, then, some general account of when 
it is right to revise our intentions. 

Friedrich Nietzsche” seems to have held that it is always wrong to 
revise an intention: 


” See, for instance, the notion of proximal intentions in Mele, Springs of Adion 
(New York: Oxford, 1992), part I. 

H Beyond Good and Evu (Harmondsworth: Penguin, 1973); ated also in 
Watson, “Skepticism about Weakness of Will,” Philosophical Renew, Uxxxvi (1977): 
316-39, here p. 328. 
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To close your ears to even the best arguments once the decision has 
been made: sign of a strong character. Thus an occasional will to stupid- 
ity (wid. 8107). 


Perhaps this is good advice for football referees. But as a general atti 
tude it is surely a recipe for disaster. Moreover, it is not even true 
that such intransigence is a sign of a strong character. One is re- 
minded of Somerset Maugham’s"' jibe against the Vicar of Black- 
stable: “Like all weak men he laid an exaggerated stress on not 
changing one’s mind” (ibid., chapter 39). Nevertheless, if our ac- 
count of intentions is correct, there is something right about the 
idea that there should be a real reluctance to revise an intention. To 
see this, let us return once more to Bratman, and to his account of 
the circumstances in which we rationally should reconsider our in- 
tentions. His idea is that it is rational to reconsider an intention just 
in case doing so manifests tendencies that it is reasonable for the 
agent to have; similarly, it is rational to fail to reconsider an inten- 
tion just in case this manifests tendencies that it is reasonable for the 
agent to have.’? This means, of course, that sometimes it will be ræ 
tional not to reconsider an intention even when reconsideration 
would, in that instance, have been in the agent’s interests. In gem 
eral, once we have decided in which restaurant to eat, it is a good 
idea to let the matter rest, without endlessly discussing the pros and 
cons; and this is true, even though it occasionally means that we shall 
go to a restaurant that we could have realized was not the best 
choice. 

Presumably, the converse of Bratman’s criterion also holds: it is 
not rational to reconsider something if the reconsideration mani- 
fests tendencies that it is not reasonable for the agent to have. 
Adding this to the account proposed here, we arrive at the following: 
actors show weakness of will when they revise an intention as a result 
of a reconsideration that they should not have performed; that is, 
when their reconsideration exhibits tendencies that it is not reason- 
able for the agent to have. 


4 Of Human Bondags(New York: Doran, 1915). Note that Maugham does not as 
cribe weakness of will to the vicar. There is more than one kind of weakness; what 
is at issue here is more like weakness of character. For a further distinction, con- 
sider J. L. Austin’s complaint about the “grotesque confusion of moral weakness 
with weakness of will"—"A Plea for Excuses,” m his Philosophical Papers (New York: 
Oxford, 1961), pp. 123-52, here p. 146, note 1. 

= Bratman, Intention, Plans, and Practical Reason, p. 68. He actually talks in terms 
of habits rather than tendencies. But for something to be a habit, it must be habit- 
ually exhibited. In contrast, it is possible for a tendency to be only rarely exhibited, 
if at all I want the latter notion. 
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As it stands, this will seem horribly vague. Which tendencies of re- 
consideration is it reasonable for an agent to have? I would suggest 
rules of thumb like the following: 


(1) It is reasonable to have a tendency to reconsider intentions, if one 
believes that circumstances have changed in such a way that they de- 
feat the purpose of having the intention. 

(2) It ıs reasonable to have a tendency to reconsider intentions, if one 
believes that they can no longer be carried out. 

(3) It is reasonable to have a tendency to reconsider intentions, if one 
believes that they will lead one to great suffering when that suffer- 
ing was not envisaged at the time of forming the intention. 

(4) It is reasonable to have a tendency not to reconsider intentions in 
circumstances that prevent clear thought, if those intentions were 
made in circumstances that allow clear thought. 

(5) It is reasonable to have a tendency not to reconsider intentions that 
were expressly made in order to get over one’s later reluctance to 
act. 


This is still pretty vague: the principles themselves are vague, they 
are doubtless incomplete, and I have said nothing about which of 
them should take precedence over which. But I shall not try to elim} 
nate this vagueness. Vagueness is a problem when we try to deter- 
mine the extension of a concept when we try to provide a means of 
telling for each particular act whether it does or does not display 
weakness of will. My concern here is rather with giving an analysis of 
the concept itself. If the concept is vague, then the analysis had bet- 
ter be vague, too, and along just the same dimensions. I hope that 
we shall see that the account proposed here provides just the right 
sort of vagueness. 

The account is not quite plausible as it stands. I need to make one 
clarification and one amendment First, the clarification. Suppose I 
intend to perform a rash and dangerous act: leaping from a high 
cliff on an untested homemade hang-glider. Would I show weakness 
of will in revising my intention? That rather depends on why I revise 
it. If I reassess the chances that the hang-glider will fail, and decide 
on a program of more cautious preliminary testing, that is surely a 
reasonable revision. If, on the other hand, I simply suffer a failure of 


3 Perhaps this rule is otiose. Perhaps it is a necessary condition on the fonnaiton 
of an intention that one does not believe that one will be unsuccessful in carrying 
it out. Or, more controversially, perhaps one must actually believe that one will be 
successful. (For discussion and references, see Bratman, Intention, Plans, and Pract 
cal Reason, pp. 35-41; he denies that these are necessary conditions.) Equally then, 
perhaps it is imposenbie to mariain an intention in the face of a new belief that 
one cannot carry it out, or even 1n face of the loss of the belief that one can. 
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nerve, backing off from the edge of the cliff at the sight of the drop 
before me, a failure I would have suffered no matter how good the 
hang-glider, then that is plausibly a case of weakness of will. So the 
question is not simply whether or not it is reasonable to reconsider 
my intention; it is whether or not the particular reconsideration I 
perform is one that exhibits tendencies that it is reasonable for me 
to have. 

Second, the amendment.* Some people go in for a lot of low-level 
intention revision. They decide to go to one restaurant, change their 
minds in favor of another, switch back to the first, again to the sec- 
ond, and so on. Now, it is not reasonable to have such tendencies; 
much time and effort is wasted, and interpersonal coordination be- 
comes a nightmare. But while we might describe such people as 
fickle or capricious, we would not normally describe them as weak 
willed. So the account offered so far is too broad. We need to know 
what is distinctive about weakness of will, what it is that distinguishes 
it from caprice. 

J mentioned before that one reason for forming an intention can 
be in an attempt to overcome contrary desires that one believes one 
will have when the time comes to act. An intention can be, as we 
might say, contrary inclination defeating. Now, this might be only part 
of the reason for forming a particular intention, and not a very im- 
portant part at that. Indeed, sometimes this aspect might be merely 
implicit in an intention, and might only emerge if we asked the per- 
son what they would do if they were to change their preferences. 
The notion is not vacuous, however, since some intentions clearly 
are not contrary inclination defeating. Thus, someone might form 
an intention that has the content: I will go to The Red Lion 
tonight, providing I still feel like going (and such an intention can 
still have an important role in coordination, despite its conditional 
nature). 

So we have a distinction between intentions which are designed to 
be contrary inclination defeating, and those which are not. This pro- 
vides us with the grounds that we need for distinguishing weakness 
of will from caprice. If someone overreadily revises an intention that 
is, at least partially, contrary inclination defeating, that is weakness of 
will; if they overreadily revise an intention that is not, that is caprice. 
Consider again the vacillating diner. Suppose he has become con- 
cerned about his tendency to keep changing his mind, and so re- 
solves to go to a particular restaurant even if another seems more 


M Jeanette Kennett showed me that I needed to make an amendment here. 
Lloyd Humberstone showed me how to make it 
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attractive later on. In other words, suppose he forms a contrary 
inclination-defeating intention to go to that restaurant. Then if he 
revises his intention once again, he would not merely be capricious; 
he would display weakness of will. 

Thus, we can think in terms of a genus with two species. The 
genus is irresoluteness: unreasonable revision of intention. The two 
species, weakness of will and caprice, are distinguished by whether 
the intentions revised are contrary-inclination-defeating intentions 
or not. I am not sure how theoretically important this distinction is; 
moreover, we have now introduced further vagueness at the bound- 
ary between the two species. But the distinction, and the vagueness 
at the boundary, does, I think, correspond to our ordinary usage. 
Since this is a paper on weakness of will, I shall limit discussion to in- 
tentions that are contrary inclination defeating; but since that is 


rather a mouthful, I shall just call them intentions. 
HL ADVANTAGES OF THE PRESENT PROPOSAL 


I shall present six reasons for believing the account of weakness of 
will which is offered here; or, to be more precise, six reasons for pre- 
ferring it to the traditional account that understands weakness of will 
as a failure to do what one judges to be best.” 

(1) Accommodating weakness of will tn cases of indifference or mcommensura- 
itty. | mentioned before that we form intentions in cases where we are 
indifferent between the alternatives, or treat them as incommensu- 
rable. When we do so, and revise our intentions, we can show weakness 
of will. That is just what the account proposed here would predict. But 
the traditional account can make no sense of such cases; for, by hypoth- 
esis, the agents involved do not judge one option to be better than the 
other, and so cannot be acting against their better judgment. 

To see the possibility of such cases, consider again the example of 
incommensurability mentioned above. I am caught between the con- 
flicing demands of fighting fascism and of staying to look after my 
mother. Unable to compare them in any meaningful way yet know- 
ing I must choose, I decide to go and fight. But, then, when the mo- 
ment to leave comes, I revise my intention and stay with my mother. 
That can be an example of weakness of will. (Note that to generate 
such a case we need not say that the options really are incommensu- 
rable; only that the agent judges them to be so.) 

(ï) Explaining the relation of weakness of will to strength of will. One 
would expect the property opposed to weakness of will to be 


= Some of these considerations also, I beleve, provide reasons for preferring it 
to the accounts offered by Bigelow, Dodds, and Pargetter, and by Jackson; but I 
shall not argue for this contention here 
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strength of will. And strength of will is something that shades off into 
stubbornness. The account I propose explains this in a straightfor- 
ward way. A person shows strength of will when they stick by their in- 
tentions in circumstances in which they are right to do so; that is, 
when they do not reconsider them, and not doing so exhibits ten- 
dencies that it is reasonable for them to have. Strength of will turns 
to stubbornness when they stick by their intentions even when it is 
reasonable to reconsider and revise them. And, of course, the 
boundary between these two is vague; reasonable people will dis- 
agree about which tendencies of intention revision it is reasonable to 
have, just as they will disagree on when strength of will moves over 
into stubbornness. (One way of understanding the quotation from 
Nietzsche is to see him placing the boundary at one extreme.) Here, 
then, is one place where the vagueness in the ana4sans mirrors the 
vagueness in the anahsandum. 

In contrast, the traditional account has real problems in explain- 
ing strength of will. Indeed, defenders of the traditional account 
have typically contrasted weakness of will with self-control, where this 
is understood as the ability to do what one believes to be best. But 
self-control is not continuous with stubbornness, in the way that 
strength of will is. It is perhaps possible to have too much selfcontrol; 
but the worry there is that one lacks spontaneity, and lacking spon- 
taneity is very different from being stubborn. 

(+) Accounting for cases of oscillating weakness of will. Consider this 
example from Thomas Schelling": 


As a boy I saw a movie about Admural Byrd’s Antarctic expedition and 
was impreased that as a boy he had gone outdoors in shirtsleeves to 
toughen himself up against the cold. I resolved to go to bed at night 
with one blanket too few. That decision to go to bed minus one blanket 
was made by a warm boy. Another boy awoke cold in the night, too cold 
to retrieve the blanket and resolving to restore it tomorrow. But the 
next bedtime it was the warm boy again, dreaming of Antarctica, who 
got to make the decision. And he always did it again (1nd, p. 59). 


Let us suppose that the cold Schelling had at least tried to retrieve 
the blanket. How would we describe the situation? Would we say that 
Schelling displayed weakness of will when he reached for the blanket 
despite his earlier resolution to leave it off? Or would we say that he 
displayed weakness of will in leaving the blanket off again the follow 
ing night, despite his resolution to leave it on? I think that we might 


$ “The Intimate Contest for SelfCommand,” in his Chows and Consequence 
(Cambridge: Harvard, 1984), pp. 57-82. 
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well want to say both (we would need to fill out the details in the 
right way; he must form each intention seriously, having given it 
proper consideration, and so on). The account I am offering enables 
us to do so. We say that in each case, in overturning his previous in- 
tention, Schelling displays tendencies that it is not reasonable for 
him to have. But on the traditional account of weakness of will, it is 
not easy to see how we can get this result. 

On the traditional account, the normal case is one in which the 
agent maintains a view about which action is the right one, and 
then acts contrary to this belief. But that cannot be the case here, 
since if the young Schelling maintained a view, then it would have 
been either that it is best to leave the blanket on, or that it is best to 
take it off. But if he maintained either of these views, then it is not 
possible for both actions to display weakness of will, since it is not 
possible for both actions to be contrary to the view maintained. Al- 
ternatively, proponents of the traditional account might think that 
the young Schelling changed his mind about which course was the 
best. Then, in order to characterize him as doubly weak willed, they 
would have to say that in the middle of the night, shivering beneath 
one blanket too few, he believed that what was best was to keep the, 
blanket off, and his weakness stemmed from reaching for it; and 
they would have to say that when going to bed, warm and dreaming 
of the Antarctic, he believed that it was best to keep the blanket on, 
and his weakness stemmed from his leaving it off. But that is com- 
pletely implausible; if there were any reason for thinking that, in 
the middle of the night, he believed it best to keep the blanket off, 
that would be because he believed this when he went to bed, and 
had not changed his mind; and similarly, if there were any reason 
for thinking that, at bedtime, he believed it best to keep the blanket 
on, that would be because he believed this in the middle of the 
night, and had not changed his mind. 

(tu) Allocating the stigma of weakness of will There is a considerable 
stigma attached to being weak willed. I think that the account of 
fered here correctly predicts the times that it is deserved, and the 
traditional account does not. I have a friend who believes that all the 
arguments point to the same conclusion: he should not eat meat. 
But he is not moved. “I am,” he says disarmingly, “inconsistent.” Now 
there is something very odd about that, but, although he is akrafic, 
he does not attract the stigma that attaches to weakness of will. In- 
deed, I would never call this friend weak willed. Suppose, however, 
that he announces that he has decided to give up meat; his intention 
is firm. If I then find him roasting himself a suckling pig in a pit, I 
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shall say that he is weak willed, and quite rightly. It is the failure to 
persist in the intention that makes all the difference. 

We can make the point in another way. Suppose my friend tells 
me that he will give up eating meat on the first day of January; it is 
his New Year’s resolution. I find him ruefully eating sausages on New 
Year’s Eve; I cannot scorn him for his weakness of will. Yet were I to 
find him eating the same meal the following day, I could. His views 
about what is best have not changed over the two days; the differ- 
ence stems from the intention. 

These considerations are relevant to one of the binding strategies 
that was mentioned before. If I fear that I shall not persist in my in- 
tention, then one thing I can do is to make it public; for the scorn 
that I shall suffer if I revise the intention will provide me with an in- 
centive to keep with it. The important point here is that, if I want to 
enlist the potential scorn of others in this way, it is not much use sim- 
ply announcing my opinion of what is best; I need to announce my 
intention. 

(v) Explaining cases of weakness of will without inner conflict. Some 
people say that in cases of weakness of will there must be an inner 
conflict: the agent must feel the pull of the course that they are 
weakly abandoning. And they take this as an argument for the tradi 
tional account.’” That strikes me as quite wrong. Of course, when we 
self-ascribe weakness of will, we shall typically feel some tension; oth- 
erwise, we would not know to self-ascribe it. But we can ascribe weak- 
ness of will to a third party in whom there is no conflict. We surely 
can ascribe weakness of will to a person who has vowed to give up 
smoking, and who blithely starts up again straight away, saying that 
they have changed their mind (again, we would need to fill in the 
details in the right way). We can make perfect sense of this on the 
view proposed here: we simply say that they gave up their intention 
too easily. We need to be sure that the initial intention really was 
formed, and too sudden a change might make us doubt that, and 
the revision itself might be seen as involving a kind of conflict. But 
there need be no internal conflict at all at the point at which they 
smoke the cigarette. 

Thus it seems that rather than be a point in the traditional ac- 
count’s favor, its requirement that there be internal conflict actually 
counts against it. For it is hard to see that the traditional account 
could count our recidivist smoker as weak willed unless there was, 
deep down and repressed, a belief that smoking was the wrong thing 


For instance, Christopher Cordner, “Jackson on Weakness of Will,” Mind, Xav 
(1985): 273-80 
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to do. Yet we do not need to attribute such self-deception in order to 
attribute weakness of will. 

Note that I have not said that cases of akrasia are impossible. Sup- 
pose that I were tempted by the arguments of Plato and Aristotle 
and their latter-day followers, arguments to the conclusion that a 
person cannot choose other than that which they judge best. Then I 
might try to exploit the account given here to say that what appears 
to be akrasia is really just an unreasonable revision of intention. But I 
am not tempted by those arguments. So I can happily accept that 
there are cases of akrasta, and that they do give rise to internal con- 
flict. I simply claim that unless they also involve an unreasonable re- 
vision of intention, they are not cases of weakness of will. 

(vi) Accommodating cases of both akrasia unthout weakness of will, and 
weakness of will without akrasia. So far we have considered cases of 
weakness of will without akrana, and we have considered cases of 
akrasia without weakness of will. There are some cases that display 
both. Consider the following three examples. 

(a) Ravi has devoted his life to his poetry and still considers it the 
most important thing in the world. But he has fallen in love with 
Una, an English schoolgirl. She finds herself pregnant, and, fearing 
that her father will force her into an abortion, they elope. Ravi vows 
that no one will harm the child. His commitment is short lived, how 
ever. They are found, and the police threaten him with prison. 
Faced with the choice between standing by Una and the unborn 
child, or leaving her to a forced abortion and going free to pursue 
his poetry once again, he shamefacedly chooses the latter.’ 

(b) Christabel, an unmarned Victorian lady, has decided to em 
bark on an affair that she knows will be disastrous. It will ruin her 
reputation, and quite probably leave her pregnant. Moreover, she 
considers it morally wrong. So she thinks it not the best option on e} 
ther moral or prudential grounds. Nevertheless, she has resolved to 
go ahead with it. At the very last moment, however, she pulls out: not 
because of a rational reconsideration of the pros and cons, but be- 
cause she simply loses her nerve.” 

(c) The President has his finger on the nuclear button and his 
threat is simple: if the enemy invades, he will retaliate in massive 


= Rumer Godden, The Peacock Spnng (London: Macmillan, 1975). 

* Thanks to Rae Langton for this example. It 1s adapted from the story of 
Christabel LaMotte ın A.S. Byatt’s novel Possesson (London: Chatto and Windus, 
1990)—-Byatt’s Christabel, however, does not lose her nerve. In thinking about the 
case, ıt is important to recall the distinction, made above in discussing the hang 
gliding example, between a revision that it is rational to make, and a revision that 
18 made rationally 
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fashion. That will be a catastrophe for everyone, for the President 
and his people as well as for the enemy. But he reasons that such a 
threat is needed to deter; and that in order for the threat to be 
credible it must be genuine: he must really form the conditional in- 
tention to retaliate if the enemy invades. Then the unthinkable hap- 
pens: the enemy invades. Enraged, the President prepares to 
retaliate. But then, faced with doing what he takes to be morally 
and prudentially wrong, he hesitates, revises his intention, and de- 
sists.™ 

All three examples have a similar form. At some point, each agent 
either forms, or finds themselves with, an intention that is in some 
sense akratic: an intention to do other than what they judge best. 
Ravi intends to stand by Una; Christabel intends to have the affair; 
the President intends to push the button. None at this point seems 
to show weakness of will, notwithstanding the gap between what they 
intend and what they judge best. So here we have akrasia without 
weakness of will; or at least, we would have if the agents had acted on 
those intentions. Then the agents revise their intentions, bringing 
them into line with what they judge best. Ravi now intends to aban- 
don Una and pursue his poetry; Christabel now intends to forego 
the affair; the President now intends to take his finger off the button 
without pushing it. But in each case, it is precisely the revision that 
leaves them open to the charge of weakness of will. So here we have 
weakness of will without akrasıa. 

For Ravi and Christabel, I think it is clear that revising the inten- 
tion constitutes weakness of will. There are, however, important dif- 
ferences between the two examples. Ravi has a guilty conscience 
about what he is doing; while he does judge it the best option, all 
things considered, he does not judge it to be the morally best op- 
tion. So might we try to save the traditional account by arguing that 
weakness of will consists in failing to do what one judges to be 
morally best? Christabel’s case shows us that this defense will not 
work. She does think that the best option, all things considered, is 
the morally best option, yet in revising her intention she, too, shows 
weakness of will.” 


» Tam grateful to David Lews for this example. I am not sure that he intended 
it for quite the use to which I have put it 

= Advocates of the traditional account have typically insisted that the judgment 
of what 1s best, all things considered, 1s not the same as the judgment of what is 
morally best; and that the account of weakness of will should be framed in terms of 
the former notion, not the latter. See Donald Davidson, “How Is Weakness of the 
Will Possible?” in his Essays on Actions and Events (New York: Oxford, 1980), pp. 21- 
42, here p. 30. 
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The case of the President is more complicated. It is not clear 
whether we should ascribe weakness of will to him; but that is be- 
cause it is unclear whether or not it is reasonable for him to have a 
tendency to change his mind in this case. Let us return to the point 
at which the President was still trying to make the deterrence policy 
work. Then, at least according to the logic of deterrence, it was rea 
sonable for him to form the intention to retaliate if the invasion hap- 
pened. At that point, we could surely have expressed a conviction 
that he would stick with this intention by describing him as strong 
willed; and, equally, we could have expressed a doubt that he would 
stick with it by describing him as weak willed. That is what the ac- 
count predicts. But once things have gone wrong, it is not obvious 
what he should reasonably do, even if we accept the logic of deter- 
rence. On the one hand, he reasonably judged that he should retal 
ate in case of invasion, and he is now in such a case; on the other, 
triggering an all-out nuclear war is madness. If we incline to the sec- 
ond of these considerations (as we surely should) then we should 
not describe him as weak willed if he revises his intention. But the 
memory of what we said before the invasion should make us uneasy 
with that judgment. In cases like this, we get ourselves into a very 
real and very dangerous muddle over what it is reasonable to do; our 
confusion over applying the notion of weakness of will reflects that 


muddle. 
IV. RESPONSES TO OBJECTIONS 


I have given my reasons in favor of the account that I propose. I shall 
finish by responding to five possible objections. 

(i) Why be so strict? Amelie Rorty, in her enlightening paper “Where 
does the Akratic Break Take Place?” answers her own question by 
providing a multitude of candidates (op. ai.).7 I have suggested 
that weakness of will consists in just one kind of failing. But 
why should I be so strict? The English language is a plastic in- 
strument. It seems overwhelmingly likely that sometimes peo- 
ple have used the expression ‘weakness of will’ to describe 
other failings, such as some of the others that Rorty mentions. 
In particular, is it not very likely that sometimes it has been 
used to describe the case in which a person fails to form an in- 
tention to do what he knows to be best, and as a result does 
what he knows not to be. In short, is it not very likely that the 
traditional account captures one of our uses of the expres- 
sion? 


= Note that Rorty 1s here giving a discussion of akrana, not of weakness of will; 
like me, she denies that the two are the same. See note on p. 333. 
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Perhaps that is right. If so, I should rest content with the claim 
that many cases of weakness of will are captured by the account pro- 
posed here; I should offer it as a supplement to the traditional ac- 
count, not as a replacement. But I cannot help thinking that the 
traditional account is not simply inadequate, but straight-out wrong. 
First, we have seen a number of cases in which people appear to be 
akrahe without being weak willed. Second, I doubt that there are any 
clear cases of weakness of will that can be captured by the traditional 
account and not by mine. 

Consider this example concerning Jim Dixon, hero of Kingsley 
Amis’s novel Lucky fim.” Dixon, a junior and untenured lecturer, is 
staying with Professor Welch, his head of department and arbiter of 
his future in the university. He awakens after a night of heavy drink- 
ing to discover cigarette burns through his bedclothes. 


Had he done all this himself? Or had a wayfarer, a burglar, camped out 
in his room? Or was he the victim of some Horla fond of tobacco? He 
thought that on the whole he must have done it himself, and wished he 
hadn't. Surely this would mean the loss of his job, especially if he failed 
to go to Mrs. Welch and confess what he'd done, and he knew already 
that he wouldn’t be able to do that (sid, p. 63). 


Would we not say that Dixon displays weakness of will here? Yet he 
never forms an intention to tell Mrs. Welch. So must we not accept 
that there can be weakness of will that does not consist of a failure to 
do what one intends? This seems like good evidence that the tradi- 
tional account is sometimes correct. But J am not so sure. Why does 
Dixon not form the intention to tell Mrs. Welch? Presumably, be- 
cause he knows that he would not be able to go through with it, if he 
did. So he knows that, if he did form such an intention, he would 
display weakness of will, where this is understood in terms of the 
present account. It is because he knows that he is someone with a 
tendency to weakness of will that he acts as he does. So, on the ac- 
count given here, his weakness of will explains his action (or rather 
his inaction). It seems to me that is good enough. Once we have said 
that we feel no compelling need to insist that Dixon actually exhibits 
weakness of will here. 

If we are to get a really compelling counterexample, we need a 
clear case of weakness of will in which a person knows that, if he 
were to form the intention to do what he judges best, he would stick 
to that intention; but he fails to form the intention. Perhaps I am 
blinded by partiality to my account, but I am unable to think of one. 


” London: Gollancz, 1958. 
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(ù) Can the account accommodate disagreement? I said that one plausi- 
ble way of filling out the story of the young Schelling and his blan- 
kets is to say that he displayed weakness of will twice over. But there 
are other things we might say. Knowing Schelling’s subsequent ca- 
reer as an economist rather than an explorer, we might say that the 
real Schelling (the rational realistic Schelling) is the one who 
wanted to keep the blanket on; hence it was only the fanciful 
dreamer who displayed weakness of will when trying to get it off. 
(Contrast Francis Chichester,“ who, true to his real self, stripped his 
childhood bed down to just a sheet, seems never to have been 
tempted to pile on extra blankets, and became a fearless adven- 
turer.) Alternatively, we might say that the weakness of will occurred 
only in the middle of the night, the real Schelling was the one who 
made his decisions in the light of day, not tempted by the lure of the 
warm blanket. We can imagine that debate about this could go on 
for some time. We can imagine the kinds of evidence that would be 
relevant. And we can even imagine that, when all the available evi- 
dence was in, people might still disagree. 

It is easy to see how the traditional account of weakness of will can try 
to explain this disagreement. The debate would be over what Schelling 
really valued: his comfortable sleep or his training as an explorer. Can 
the account offered here explain it as well? I think it can; indeed, I 
think it can actually do better than the traditional account. The ac- 
count offered here employs both a descriptive and a normative ele- 
ment. To display weakness of will, an agent must have formed an 
intention that he revises; that is the descriptive element. And his revi- 
sion must have been something that, by the standards of a good inten- 
der, he should not have done; that is the normative element. 
Disagreements about whether or when agents manifest weakness of will 
typically result from disagreements about the second of these two fac- 
tors. So we might argue that Schelling should have kept to intentions 
that were formed when he could think clearly (such as in the daytime) 
rather than when he could not (such as when he was tired at night). Or 
we might argue that he should have kept to intentions that were realis- 
tic (the intention to get a good night's sleep) rather than romantic and 
unrealistic (the intention to become an Antarctic explorer). 

Here, we have the second way in which the vagueness in the ac- 
count mirrors the vagueness in the concept. I doubt that there will, 
in general, be agreement on when an agent shows weakness of will; 
and I suspect that this disagreement is actually better captured by 


H The Lonely Sea and the Sky (London Hodder and Stoughton, 1964), p 26. 


260 THE JOURNAL OF PHILOSOPHY 


the account on offer here than by the traditional account. On the 
traditional account, the answers will depend on what the agent 
thought best, and that is a purely descriptive question. On the ac- 
count on offer here, the answers will depend on which intentions he 
should have stuck with as a rational intender. That is a normative 
question. I suspect that the deep-rootedness of the debate gives us 
some reason for characterizing it as normative, but I cannot think of 
a way of resolving the issue with any degree of certainty. 

(iti) Does the account work for policy intentions? It might be thought 
that the account cannot accommodate weakness of will with respect 
to policy intentions. Suppose I intend to get up every morning at six. 
Then can I not show weakness of will by lounging in bed today till 
nine, even though I have not revised my policy intention? The right 
answer is that I have revised the intention. I have not abandoned it 
altogether, but I have revised it. I have inserted a let-out clause: I in- 
tend to get up every day at six, except today. 

(tu) Can the account accommodate actions performed without tntenhons? I 
have written as though there is a tight link between intentions and 
actions: as though we do not get actions without intentions. Perhaps 
that connection is too tight. Suppose it is. Then it seems that the fol 
lowing is possible. I form an intention to perform some act, and then 
do something else. But I do not revise my intention; I simply act 
without an intention. Could that not be a case of weakness of will 
that does not fit the account offered here? 

If there were such cases, I doubt that they would pose too much of 
a problem; we could extend the account to embrace certain failures 
to act on intentions as well as revisions of intentions. (Not all such 
failures should be included: I do not typically display weakness of will 
if I forget my intention.) But I am not convinced that there are any 
cases of this kind. Let us consider one of Bratman’s examples of ac- 
tion without intention. Suppose someone throws me a ball and, with- 
out thinking, I just reach out and catch it. That is perhaps an 
intentional act; but it is not clear that I formed an intention to catch 
it. I acted quite without reflection.” But now suppose I form the in- 
tention not to catch the ball; you throw it, and again I just reach out 


= Bratman, Intention, Plans, and Practical Reason, p. 126. He also discusses actions 
that do not result from an intention but from what he calls an “endeavoring” 
(chapter 9). If we follow him in thinking that the distinction is necessary, shall we 
need to augment the present account by substituting ‘intention or endeavoring’ in 
place of each occurrence of ‘intention’? I am not sure, but I think not, since, on 
Bratman’s account, each endeavoring to do something which is not an intention to 
do it will nonetheless be a intention to do some other thing (p. 119), so each over 
hasty revision of an endeavoring will also be an over hasty revision of an intenton. 
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and catch it. Have I shown weakness of will? I rather doubt that I 
have. I suspect that in so far as the act really is one that is done with- 
out an intention, it is one that falls outside my immediate control, 
and so outside of the area in which weakness of will can be shown. I 
might resolve not to faint when the incision is made, not to duck 
when the jets fly low overhead, not to smile when my friend inadver- 
tently makes a clown of himself. If I cannot help myself do these 
things, however, my failure to stick by my resolution does not mani- 
fest weakness of will. Certainly, these are failures of self control. But 
this is just another reason for not equating such failure with weak- 
ness of will. 

(v) Can the account distinguish weakness of will from compulsion? It is a 
commonplace in the philosophical literature that weakness of will 
should be distinguished from compulsion. On the traditional ac- 
count, the distinction is made as follows: one displays weakness of 
will when one freely acts against one’s own best judgment; one is com- 
pelled when one’s action against one’s own best judgment is not 
free. But it might seem that the account offered here can make no 
such distinction. Consider a compulsive person—a kleptomaniac, for 
instance—who intends never to steal again, but who revises that in- 
tention and does. Will his revision not be one that he rationally 
should not have made? Yet if so, he will count as weak willed.” 

I have two things to say about this. First, it is not obvious to me 
that it would be a disaster if the account did classify compulsive acts 
as weak willed; for it is not obvious to me that they are not. Certainly, 
we would not normally say that a compulsive person was weak willed, 
but that could be because it would be inappropriate rather than false 
to say so—in the way that it would be inappropriate to say that sadis- 
tic torturers are unkind. Are we really averse to saying that comput 
sives are pathologically weak willed? 

Second, it is not obvious whether the account offered here would 
classify compulsives as weak willed. Certainly, one can imagine cases 
for whom the compulsive action is so automatic that there is no need 
of an intention, and hence no need to revise any contrary intention: 
for the kleptomaniac who simply finds himself placing objects in his 


= See Watson; Kennett and Michael Smith, “Philosophy and Commonsense: 
The Case of Weakness of Will,” in Michaelis Michael and John O’Leary-Hawthorn, 
eds., Philosophy in Mind (Dordrecht Kluwer, 1994), pp. 141-57. Watson argues that 
the relevant notion of a free action is one which the agent could have performed 
had they developed in themselves the kinds of skills and capacities that we expect 
them to develop. 

r Thanks to Smith and Linda Barclay for making me think about this issue. 
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pocket, intentions to steal or not to steal are quite beside the point. 
So such cases will not be classed as weak willed. But what of the klep- 
tomaniac who consciously but compulsively does revise his intention 
not to steal; that is, who is simply unable to refrain from making the 
revision? Will he count as someone who has an unreasonable ten- 
dency to revise, and hence as someone who is weak willed? This de- 
pends, of course, on how we determine what is reasonable. We could 
use internal criteria, criteria that always restrict what it is reasonable 
for an agent to do to that which it is in the agent’s power to do.™ 
Such criteria would not classify the kleptomaniac’s revision as unrea- 
sonable, and hence would not classify him as weak willed. Alterna- 
tively, we could use external criteria, criteria that do not restrict what 
it is reasonable for an agent to do to that which it is in the agent’s 
power to do. Such criteria probably would class the kleptomaniac’s 
revision as unreasonable, and hence probably would classify him as 
weak willed. I have not come down in favor of one or the other of 
these criteria of what is reasonable; indeed, I suspect that while Iam 
trying to give an account of our ordinary notion of weakness of will, I 
should remain uncommitted. For I suspect that this is another place 
where our ordinary concept is vague. 

So the present account of weakness of will has difficulties with 
compulsion only if two further claims are met: if it is right to use ex- 
ternal criteria of what is reasonable, and if the compulsive should 
certainly not count as weak willed. I have questioned both of these 
claims; but even if they were true, a simple amendment to the ac- 
count would fix things up. Rather than say that weakness of will con- 
sists in overreadily revising an intention, say that it consists in 
overreadily revising an intention when i is tn the agent’s power to desist 
from that revision. 

RICHARD HOLTON 
University of Sheffield 


™ In the agent's power in some broad sense; we could follow Watson in thinking 
that what is important 1s what would have been in the agent’s power if the agent 
had developed the capacities that we normally expect them to develop. 
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Open Minded: Working Out the Logic of the Soul JONATHAN LEAR. Cam- 
bridge: Harvard University Press, 1998. 342 p. Cloth $35.00. 


A mind that is not open is closed; it is a closed mind. So the differ- 
ence between them cannot be that the one does, while the other 
does not, entertain ideas. Nor can it lie in the heat of the entertain- 
ing, since open-mindedness is consistent with passionate conviction; 
it depends on how conviction is achieved, on whether it is sustained 
by self-deception. Perhaps, then, only one who knows his own mind 
can have a mind that is open. 

In the preface to his provocative, often brilliant book, Jonathan 
Lear writes: “We lose sight of Plato’s project, laid out so beautifully in 
the Republic, of giving a nonformal yet not nonempirical account of 
what it is to be human. Plato, one might say, is working out [in the Re- 
public] the very idea of what it is to be minded as we are. And he does 
this in the light of Socrates’ exemplification—a life spent showing 
us—that one of the most important truths about us is that we have 
the capacity to be open-minded the capacity to live nondefensively with 
the question of how to live” (8). It is a story that is best continued 
later by Sigmund Freud, for it is he who teaches us about the mind’s 
strategies of defense, about the varieties of psychic pain and psychic 
trauma, and about how our defenses against them may sometimes be 
more damaging to the mind than the enemy it tries to keep out. 

Plato and Freud are the two intellectual heroes of this book, the 
twelve essays of which together yield a vision that is Freudian in the 
best sense. The dark side of the vision is that the mind is not so ræ 
tional—in that sense not so open—as we like to think; that the divi- 
sion between the known and the unknown comes not where mind 
edges world, but within the mind itself; and that defensively to deny 
reason’s limits invites both personal and social disaster. The lighter 
side is that resistances to reason are not always final, and that a 
greater degree of psychic wholeness, which the mind naturally wants, 
can be gained precisely by working with and through resistance. 

The lead essay titled “On Killing Freud (Again),” which first ap- 
peared in The New Republic as a response to the Library of Congress's 
decision to cancel the Freud exhibit, sets the tone. Like most psycho- 
analysts, Lear interprets the Oedipal complex as having to do with 
childhood phantasies of omnipotence. The complex is “resolved” in 
relinquishing these phantasies, so acknowledging the limits of both 


0022-362X/99/9605 /263~-69 © 1999 The Journal of Philosophy, Inc. 


264 THE JOURNAL OF PHILOSOPHY 


one's power and one’s knowledge. This is the sense in which, on 
Lear’s reading, the drama of Sophocles’ play is Oedipal. Oedipus 
manically insists that he is a self-made man, sufficient unto himself, 
that prophecy is blind, reason is all. Lear calls these pretenmons 
knowingness, a defensive attitude motivated by anxieties about one : 
limits. Whether we call the unknown “Fate” or “the unconscious,” 
tragic consciousness in any culture consists in the recognition that 
human beings live their lives, as Lear puts it, “in the intersection of 
two realms of meaning” (53). He grants that the Freud-bashers have 
many legitimate complaints, but claims one can dismiss Freud out of 
hand only by denying that we are often moved by forces within us 
which we do not understand. The Freud-bashers exhibit a contem- 
porary form of knowingness. 

The companion essay that follows, “Knowingness and Abandon- 
ment: An Oedipus for Our Time,” develops these ideas into a cri 
tique of postmodernist thinking. The Enlightenment “constituted 
itself with a manic, oedipal defense: even though the gods have left 
us, human reason can take their place” (54). But since its function is 
to defend the mind against something awful that it half recognizes to 
be the case, such a defense is inherently unstable. Postmodern con- 
sciousness, Lear suggests, originates in the collapse of this defense. 
The idea is not that the postmodernist has worked through his 
“Oedipal” problems about knowing, rather that one of the criteria 
for a “successful” analysis, as it were, is the ability to recognize and 
face the problems as they continually reappear. The psychoanalytic 
aim is not to get rid of all defenses, but only those which are inflexi 
ble, outdated, and, on the whole, too costly. 

In “Restlessness, Phantasy, and the Concept of Mind,” Lear argues 
that philosophical tradition in the West has often held a view of the 
mind skewed toward reason in ways Freud tried to correct. Lear’s strat- 
egy is first to show that a familiar contemporary account of irrationak 
ity, which I shall call the rationalizing account, is unsatisfactory, then to 
show, in giving a better account of it, that what we call irrationality is 
not a knot at the mind’s brim, but a lead into its very center. 

On the rationalizing account, as Lear presents it, irrationality re- 
quires us to posit a mind subdivided into structures of propositional 
attitudes, each of which is rationally coherent within itself. “On this 
schema, the unconscious mind would be conceived of as its own lo- 
cus of rationality (or quasi-rationality)...” (82). The account holds 
that in this way we can allow for that degree of holism and rationality 
which the concept of the mental presumes, while making room for 
phenomena like self-deception and akrasta, The move that makes 
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for division, the account continues, is a mental cause, namely, the 
wish to ignore the judgment of reason in a particular case, that is not 
a reason (a good reason) for what it causes. Lear clearly has Donald 
Davidson in mind in all this, though he mentions him by name only 
in the notes. 

Lear thinks the rationalizing account unsatisfactory because it 
leaves unexplained the fact of irrationality itself, that is, why in any 
particular case the better reason was not operative. It teaches us 
nothing about how the irrationality comes into being. Its fundamen- 
tal error is the failure to see that restlessness, as Lear puts it, is funda- 
mental to the mind, an idea he thinks was one of Freud’s distinctive 
achievements. In the Freudian view, restlessness expresses itself in 
mental tropisms, like projective and introjective phantasies, through 
which the mind attempts to rid itself of anxiety; the drives (of which 
sexuality is the paradigm); and primaryprocess mental activity. (Lear 
uses the ph spelling of ‘phantasy’ to differentiate it from the ordi- 
nary concept of fantasy as imagining or daydreaming.) All these con- 
cepts are part of the basic Freudian repertoire, though Lear's 
interpretation of them is influenced both by Melanie Klein and 
Hans Loewald (about whom I say more in a moment). 

Here is my own rough explication of projection. Let us say that a 
child is furious with someone on whom he is vitally dependent. Since 
he may then be at some level aware of his own anger as endangering 
this relationship, he becomes anxious. The mind may then handle 
this anxiety by constructing an affective picture or an affective (non- 
propositional) thought of the anger as in another person, or as part 
of another person. The anger is cast out of one’s own mind, so to 
speak, and projected onto, or (in one’s imagination) into, the other. 

Lear does not want us to think of this manoeuver as fully inten- 
tional, but rather as instancing a mental tropism, which he defines as 
“a subintentional, nonrational mental activity which has a strategic 
point: namely, relieving anxiety” (118). Phantasy disrupts rational 
thought processes in a way which is purposive but which cannot be 
accommodated to the model of akrasia or self-deception. (This idea 
was first articulated by Richard Wollheim in The Thread of Life, and 
picked up both by James Hopkins and myself.’ Phantasies must be 
seen, Lear argues, not only as contents in the mind, but also as acts 
upon the mind—for example, dividing a mind into parts, or keeping it 
from achieving the sort of integration that rationality presumes. The 


’Wollheim, The Thread of Life (New York: Cambridge, 1984); Hopkins, Introduc- 
ton to Philosophical Essays on Freud (New York. Cambridge, 1982); and my The Psy- 
choanalyic Mind: From Freud to Philosophy (Cambridge Harvard, 1993). 
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idea of division is common to both Lear and the rationalizing ac- 
count. But Lear envisions the act of dividing and the divided por- 
tions themselves differently. The concept of phantasy helps 
articulate these differences. 

In commenting on this, I now want to go back to the rationalizing 
account. | quoted Lear as saying that it treats irrationality “as a by- 
product of the mind’s being a composite of two quastminds.” Lear 
apparently sees division on the rationalizing account as just a fact 
about the mind, from which irrationality somehow ensues. This mis- 
takes Davidson’s argument in an important way: irrationality on his 
view is not a byproduct of division, but the very phenomenon that 
requires us, as philosophical analysts, to posit division. Davidson 
grants, furthermore, that there are many phenomena that are nei- 
ther rational nor irrational, but arational and mental (perhaps phan- 
tasy belongs here). Finally, he is explicit in saying that his account is 
mute about the kind of psychological detail Lear is after. It is mute 
on the descriptive, psychological question of what goes on in the 
mind when it is in the proceas of dividing, and how it looks before 
and after the division. Lear correctly sees that, by Davidson’s own ad- 
mission, the rationalizing account is highly abstract. 

Davidson also explicitly rejects the idea Lear imputes to him that 
the divisions in irrationality should be described as conscious versus 
unconscious. Neither division is entirely rational in the way Lear sug- 
gests. Instead, in Davidsonian division there is room tm both parts of 
the divided mind for contents that are not the sort of thing that are 
reasons in any case, and for connections between ideas that are nei- 
ther rational nor irrational. Lear comments that with “so much ra- 
tionality seemingly built into the very idea of mind, it’s a wonder we 
can ever take an irrational breath” (83). Taking a breath is not some- 
thing we usually do either for or in spite of reason. It is usually net 
ther irrational nor rational. In urging that philosophy of mind needs 
more distinctions than philosophers have given it, Lear overlooks 
some it has. 

But let us suppose that Lear’s description of what is missing in the 
rationalizing account is more or less right. Then we might ask the 
following questions. Is the rationalizing account compatible with 
Lear’s description? Even if it is, does this description nevertheless 
suggest a picture of the mind that is substantially different from that 
of the rationalizing account? How can Lear’s picture accommodate 
the larger considerations about meaning, language, and communi- 
cation which he does not address? And is it the case, as Lear claims 
but does not really argue, that the sources of irrationality feed also 
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those forms of creativity without which there would be no mind at all 
of the sort we value? 

These are hard questions I do not know how to answer. This much 
seems Clear: the difference between Lear’s position and that of the 
rationalizing account is at least a matter of temperament; and, like 
all such differences, it leads to differences in emphasis, even in per- 
ception. Like Lear, I believe that the psychoanalytic data force one 
to a concept of psychic intelligibility that escapes the concept of ræ 
tionality as we often think of it. Whether his views are right in the de- 
tail, Lear illumines this area of ambiguity between the rational, with 
its variant in irrationality, and the arational. 

Only two of the essays in the book have to do with subjects that are 
strictly psychoanalytic. The first is an interpretation of the psychoan- 
alytic concept of transference. Readers of Freud usually understand 
it to refer to a specific kind of thought process: the carrying over 
onto a present interpersonal relationship of those perceptions, feel 
ings, and behavior which had their origin in a past relationship and 
which tend to stamp the present with a mold that is rigidly familiar 
and deeply conflicted. Lear discusses transference in this sense. But 
often he speaks of it as if it were more or less synonymous with 
thought in general. He says that “transference...is just the psycholog+ 
cal characteristic activity of creating a meaningful world in which to 
live” (60); and that as it emerges in analysis, transference “is simply 
the idiosyncratic, unconscious side of the psyche’s fundamental ac- 
tivity: to inform the world with meaning” (70-71). The key to Lear’s 
interpretation is in the work of Loewald, a remarkable American psy- 
choanalyst to whom Lear acknowledges an important debt. For 
philosophers interested in the turns taken in recent psychoanalytic 
thinking by the concepts of instinct, Eros, Thanatos, and the repeti- 
tion compulsion, Lear’s other “psychoanalytic” essay, “The Introduc- 
tion of Eros: Reflections on the Work of Hans Loewald,” should be 
invaluable. 

My favorite essay is the one titled “Eros and Unknowing: The Psy- 
choanalytic Significance of Plato’s Sympostum.” On Lear’s reading, 
the dialogue begins as a comedy, and ends as a tragedy, with Alcibi 
ades simply unable to transcend his particularity, his bodily nature, 
as Socrates, through Diotima, wants him to do. Alcibiades, longing 
and forlorn, the very embodiment of Aristophanes’ tragicomic fable 
about our cloven ancestors, is left behind in the gap between here 
and there. Lear writes: “By now it should be clear that the psychoan- 
alytic approach to the human-erotic differs fundamentally from the 
Socratic. For one thing, psychoanalytic ‘ascent’ is toward individua- 
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tion rather than away from it” (165). And the body, as it figures, for 
example, in phantasy, enhances individuation. Eros was a divine 
source of ascent for Socrates, but only a distraction to Alcibiades. 
“From a psychoanalytic perspective, by contrast, this erotic distrac- 
tion needs to be worked with and worked through [to be seen asa 
‘transference’ insistence that another conform to set roles] if human 
life is to be afforded a comic restoration” (165-66). This is a possibil- 
ity which Lear believes Plato, but not Socrates, saw. 

A comparison between psychotherapy and philosophy easily sug- 
gests itself to philosophers, but it has never been developed as deftly 
as it is in these essays. In “Transcendental Anthropology,” Lear works 
out a subtle and complex argument to which I cannot do justice 
here, an argument that culminates in an analogy between the 
Wittgensteinian philosopher, examining “our” form of life, and the 
psychoanalytic patient’s process of self-discovery. The analogy takes 
off from a discussion of the difference in Immanuel Kant between 
the pure apperception of consciousness and the ‘I think’: the first 
must be able to accompany all representations, it informs us about 
the ‘I’ that is thinking, but, unlike the ‘I think’, it cannot itself be 
represented or observed. Similarly, the Wittgensteinian philosopher 
learns that “‘observing’ the context in which rule-following activity is 
embedded will not yield the full meaning of that activity...” (273). 
Such a philosopher is trying to gain “nonobservational” insight into 
something—he calls it a form of life—that is too intimate a part of 
our observations to be itself observed. Similarly again, although the 
analysand learns to become a better observer of himself, the point of 
therapy “is not the observation of an accurately represented person, 
but nonobservational insight into the person creating the represen- 
tation” (272). 

This raises an interesting question about just how we should un- 
derstand the psychoanalytic concept of the unconscious. Lear does 
not take this up, but looks instead in this essay toward philosophy. 
He suggests that the familiar complaint that psychoanalysis is not an 
empirical discipline may be a reason for hope, not disdain; “for per- 
haps reflection on the therapeutic model will shed light on how to 
proceed philosophically in a way which is neither a methodology of 
the sciences nor a purely transcendental investigation” (272). 

I have one large complaint about this book, and that is its promis- 
cuous use of the word ‘meaning’. It is an idea at the heart of the es- 
says on transference and irrationality. Yet passages like the following, 
about dreams and dream interpretation, are about as close to an 
analysis of meaning itself as Lear ever comes: “How could mind be 
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making a meaning it doesn’t understand? To be making a meaning, 
it must be making certain associations among ideas, engaging in 
symbolization, however elementary” (85). And what is this process of 
“making a meaning,” a phrase Lear uses more than once? Is it, in 
particular, as Lear suggests, something one does all by oneself? Such 
promiscuity is particularly noticeable in a book that invokes Ludwig 
Wittgenstein at the beginning (the preface speaks briefly of Wittgen- 
stein’s having dispelled “the myth of meaning”) and devotes to him 
its two concluding essays. 

But perhaps the best way to read Open Minded is by putting your- 
self in its hands. Then you will have the intense pleasure of travers- 
ing a terrain you thought you knew by a new road, from which, 
rounding a bend, you have again and again a stunning glimpse of a 
familiar yet usually hidden landscape. In the Sophoclean universe, 
Lear remarks, there is either a rigid reliance on human reason, or 
submission to a divine realm. “In neither position is there room for 
philosophy, that peculiarly thoughtful response to awe” (51). Open 
Mtnded is itself such a philosophical response. 

MARCIA CAVELL 
Berkeley, California 
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n recent decades, epistemology has witnessed the development 
[== growth of externalist theories of knowledge and 

justification.’ Critics of externalism have focused a bright spot- 
light on this approach and judged it unsuitable for realizing the true 
and original goals of epistemology. Their own favored approach, in- 
ternalism, is defended as a preferable approach to the traditional 
concept of epistemic justification.” I shall turn the spotlight toward 
internalism and its most prominent rationale, revealing fundamental 
problems at the core of internalism and challenging the viability of 
its most popular rationale. Although particular internalist theories 
such as (internalist) foundationalism and coherentism will occasion- 
ally be discussed, those specific theories are not my primary concern. 


* An earlier vermon of this paper was presented in Pittsburgh, at the Central Dr 
vision meeting of the Amencan Philosophical Association, April 25, 1997. My com- 
mentator on that occasion was Matthias Steup, and I am much indebted to him for 
valuable correspondence on this topic. I am also grateful to Tim Bayne and Holly 
Smith for very useful suggestions. 

’ Prominent statements of externalism include D. M. Armstrong, Bebef Truth 
and Knowledge (New York: Cambndgr, 1973); Fred Dretske, Knowledge and the Flow 
of Information (Cambridge: MIT, 1981), Robert Nozick, Philosophical Explanations 
(Cambridge: Harvard, 1981); my Epistemology and Cogniton (Cambridge: Harvard, 
1986); and Alvin Planunga, Warrant and Function (New York: Oxford, 1993). 

* Major statements of internalism include Roderick Chisholm, Theory of Know- 

(Englewood Cliffs, NJ: Prentice-Hall, 1966, 1st edition; 1977, 2nd edition; 
1989, 3rd edition); Laurence BonJour, The Structure of Empirical Knowledge (Cam- 
bridge: Harvard, 1985); John Pollock, Contemporary Theories of Knowledge (Totowa, 
NJ: Rowman and Luattlefield, 1986); Richard Foley, The Theory of Hptstemic Rationality 
(Cambridge: Harvard, 1987); and Keith Lehrer, Theory of Knowledge (Boulder: West- 
view, 1990). In addition to relatively pure versions of externalism and internalism, 
there are also mixtures of the two approaches, as found in Willam Alston, Epistemic 
Jushficahon (Ithaca: Cornell, 1989); Ernest Sosa, Knowledge tn Perspectrue (New York: 
Cambridge, 1991); and Robert Audi, The Structure of Justification (New York: Cam- 
bridge, 1998) 
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The principal concern is rather the general architecture of internal 
ism, and the attempt to justify this architecture by appeal to a certain 
conception of what justification consists in. 

L DEONTOLOGY, ACCESS, AND INTERNALISM 
I begin with a certain rationale for internalism that has widespread 
support. It can be reconstructed in three steps: 


(1) The gusdancedeontological (GD) conception of justification is posited. 

(2) A certain constraint on the determiners of justification is derived 
from the GD conception, that is, the constraint that all justification 
determiners must be accessible to, or knowable by, the epistemic agent. 

(3) The accessibility or knowability constraint is taken to imply that 
only internal conditions qualify as legitimate determiners of justi 
fication. So justification must be a purely internal affair.’ 


What motivates or underlies this rationale for internalism? Histori- 
cally, one central aim of epistemology is to guide or direct our intel- 
lectual conduct, an aim expressed in René Descartes’s title, “Rules 
for the Direction of the Mind."* Among contemporary writers, John 
Pollock expresses the idea this way: 


I have taken the fundamental problem of epistemology to be that of de- 
ciding what to believe. Epistemic justification, as I use the term, is com 
cerned with this problem. Considerations of epistemic justification 
guide us in determining what to believe. We might call this the “belief- 
guiding” or “reason-guiding” sense of ‘justification’ (op. at, p. 10). 
The guidance conception of justification is commonly paired with 
the deontological conception of justification. John Locke’ wrote of a 
person’s “duty as a rational creature” (ibid, p. 413), and the theme 
of epistemic duty or responsibility has been echoed by many contem- 


> Plantinga also traces internalism to the deontological conception: “If we go 
back to the source of the internalist tradition,...we can see that internalum arises 
out of deontology; a deontological conception of warrant...leads directly to inter- 
nalism” (op ai, pp. 24-25). Alston proposes a tly different rationale for inter- 
nalism, although his rationale also proceeds via the knowability constraint (op. at, 
p. 236). He suggests that the concept of justification derives from the interpersonal 
practice of criticizing one another’s beliefs and asking for their credentials A per- 
son can appropriately respond to other people’s demands for credentials only if he 
knows what those credentials are. So it is quite understandable, says Alston, that 
justifiers must meet the requirement of being accesible to the agent. Clearly, this 
is one way to derive the accessibility constraint without appeal to the deontological 
conception. But Alston 1s the only one I know of who advances this ground for the 
accessibility constraint In any case, most of the problems I shall identify pertain to 
the accessibility constraint itself, which Alston's rationale shares with the deonto- 
logical rationale. 

: ical Works of Descartes, Volume I, Ehrabeth Haldane and G. R. T. Ross, 
trans. (New York: Dover, 1955) 

' An Essay Concerning Human Understanding, Volume I, A. C Fraser, ed. (New 
York: Dover, 1955). 
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porary epistemologists, including Laurence BonJour (op. at), Roder- 
ick Chisholm (op. cit.), Carl Ginet, Paul Moser, Matthias Steup, 
Richard Feldman, and Hilary Kornblith.® Chisholm defines cousins 
of the concept of justification in terms of the relation ‘more reason- 
able than’, and he re-expresses the relation ‘p is more reasonable 
than q for S at t’ by saying: “S is so situated at t that his intellectual rø- 
quirement, his responsibility as an intellectual being, is better fulfilled 
by p than by q.”’ Similarly, Feldman says that one’s epistemic duty is 
to “believe what is supported or justified by one’s evidence and to 
avoid believing what is not supported by one’s evidence” (op. at, p. 
254). 

The guidance and deontological conceptions of justification are inti- 
mately related, because the deontological conception, at least when 
paired with the guidance conception, considers it a person’s epistemic 
duty to guide his doxastic attitudes by his evidence, or by whatever fac- 
tors determine the justificational status of a proposition at a given time. 
Epistemic deontologists commonly maintain that being justified in be- 
lieving a proposition p consists in being (intellectually) required or per- 
mitted to believe p; and being unjustified in believing p consists in not 
being permitted, or being forbidden, to believe p. When a person is un- 
justified in believing a proposition, it is his duty not to believe it. 

It is possible to separate the deontological conception from the 
guidance idea. In ethical theory, a distinction has been drawn between 
accounts of moral duty that aim to specify what makes actions nght 
and accounts of moral duty that aim to provide practical decision pro- 
cedures for what to do.’ If an account simply aims at the first desidera- 
tum, it need not aspire to be usable as a decision guide. Similarly, 
accounts of epistemic duty need not necessarily be intended as deci 
sion guides. When the deontological conception is used as a rationale 
for epistemic internalism of the sort I am sketching, however, it does 
incorporate the guidance conception. Only if the guidance concep- 


* Ginet, Knowledge, Perception, and Memory (Dordrecht Reidel, 1975); Moser, 
Empirical (Dordrecht Reidel, 1985); Steup, “The Deontcc Conception 
of Episternic Justification,” Philosophical Studses, Ln (1988): 65-84; Feldman, “Epis 
temic Obligations,” in J. Tomberlin, ed., Philosophical Perspectrues, Volume II (Atas- 
cadero, CA: Ridgeview, 1988), pp. 235-56; and Kornblith, “Justified Belief and 
Epustemcally Responsible Action,” Philosophical Review, Xcu (1983): 33—48 

of 2nd edition, p. 14 (emphasis added). 

* For example, R. Eugene Bales distinguishes between two possible arns of act- 
utilitarianism: as a specifier of a right-making charactenstic or as a deamon-making 
procedure. See “Act-utulitarianism: Account of Right-making Characteristics or De- 
clsion-making Procedure,” Amencan Philosophical Quarterly, vi (1971). 257-65. He 
defends utilitananism against certain cntics by saying that it does not kave to per- 
form the latter function. 
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tion is incorporated can the argument proceed along the intended 
lines to the acceasibility constraint, and from there to internalism. This 
is why I shall henceforth speak of the GD conception of justification. 

I turn now to the second step of the argument for internalism. Fob 
lowing William Alston,’ I shall use the term justifers for facts or states 
of affairs that determine the justificational status of a belief, or the 
epistemic status a proposition has for an epistemic agent. In other 
words, justifiers determine whether or not a proposition is justified 
for an epistemic agent at a given time. It seems to follow naturally 
from the GD conception of justification that a certain constraint must 
be placed on the sorts of facts or states of affairs that qualify as justt 
fiers. If a person is going to avoid violating his epistemic duty, he 
must know, or be able to find out, what his duty requires. By know, in 
this context, I mean only: have an accurate, or true, belief. I do not 
mean: have a justified true belief (or whatever else is entailed by the 
richer concept of knowledge). Admittedly, it might be possible to 
avoid violating one’s duties by chance, without knowing (having true 
beliefs about) what one’s duties are. As a practical matter, however, it 
is not feasible to conform to duty on a regular and consistent basis 
without knowing what items of conduct constitute those duties. Thus, 
if you are going to choose your beliefs and abstentions from belief in 
accordance with your justificational requirements, the facts that make 
you justified or unjustified in believing a certain proposition at a 
given time must be facts that you are capable of knowing, at that 
time, to hold or not to hold. There is an intimate connection, then, 
between the GD conception of justification and the requirement that 
justifiers must be accessible to, or knowable by, the agent at the time 
of belief. If you cannot accurately ascertain your epistemic duty at a 
given time, how can you be expected to execute that duty, and how 
can you reasonably be held responsible for executing that duty??? 

The knowability constraint on fustifiers which flows from the GD 
conception may be formulated as follows: 


KJ: The only facts that qualify as justifiers of an agent's believing p at 
time fare facts that the agent can readily know, at £ to obtain or 
not to obtain. 


* “Internalism and Externalism in Epistemology,” reprinted in his of at, pp. 
185-226, here p. 189. 

* Some internahsts explicitly reject externalism on the grounds that it cannot 
be used as a decision guide. For example, Pollock says: “[T]t is in principle impoasi- 
ble for us to actually employ externalist norma. I take this to be a conclusive refuta- 
ton of belief erter" (op at, p. ce He would not subscribe to the full 
argument for internalism I am discussing, however, because it is committed to the 

“intellecturalist model” of epistemology, which he disparages. 
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How can an agent readily know whether candidate justifiers obtain 
or do not obtain? Presumably, the agent must have a way of deter- 
mining, for any candidate class of justifiers, whether or not they ob- 
tain. Such a way of knowing must be reliable, that is, it must 
generate beliefs about the presence or absence of justifiers that are 
usually (invariably?) correct. Otherwise, the agent will often be mis- 
taken about what his epistemic duty requires. The way of knowing 
must also be “powerful,” in the sense that when justifiers obtain it is 
likely (certain?) that the agent will believe that they obtain; at least 
he will believe this if he reflects on the matter or otherwise inquires 
into it.’ As we shall soon see, internalists typically impose additional 
restrictions on how justifiers may be known. But the minimal, 
generic version of KJ simply requires justifiers to be the sorts of 
facts that agents have some way of knowing. In other words, justifica- 
tion-conferring facts must be the sorts of facts whose presence or 
absence is “accessible” to agents.’ 

Given the KJ constraint on justifiers, it becomes fairly obvious why 
internalism about justification is so attractive. Whereas external facts 
are facts that a cognitive agent might not be in a position to know 
about, internal facts are presumably the sorts of conditions that a 
cognitive agent can readily determine. So internal facts seem to be 
the right sorts of candidates for justifiers. This consideration leads to 
the third step of our rationale for internalism. Only internal facts 
qualify as justifiers because they are the only ones that satisfy the KJ 
constraint; at least so internalists suppose. 

One possible way to criticize this rationale for internalism is to 
challenge the GD conception directly. This could be done, for ex- 
ample, by arguing that the GD conception of justification presup- 
poses the dubious thesis of doxastic voluntarism, the thesis that 
doxastic attitudes can be “guided” by deliberate choices or acts of 
will. This criticism is developed by Alston,’ and I have sympathy 
with many of his points. But the voluntarism argument against 
the GD conception is disputed by Feldman (op. cit.) and John 


" For the disonction between rehability and power (phrased slghtly differ- 
ently), see my op. cit, chapter 6 

n Jack Lyons points out that to comply with one’s epistemic duty it suffices to 
know that one has (undefeated) fustfiers for proposition p; one does not have to 
know wich justfiers these are So the argument is not enttled to conclude that 
knowledge of particular jusnfiers is required by epistemic duty. Practically speak- 
ing, however, it is difficult to see how a cognitive agent could know that relevant 
jusifiers exist without knowing which particular ones exist So J shall pass over this 
objection to the internalist line of argument. 

> “The Deontological Conception of Justification,” reprinted in his op. at, pp. 
115-52. 
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Heil,’* among others. Feldman, for example, argues that epistemic 
deontologism is not wedded to the assumption of doxastic volun- 
tarism. Many obligations remain in force, he points out, even 
when an agent lacks the ability to discharge them. A person is still 
legally obligated to repay a debt even when his financial situation 
makes him unable to repay it. Perhaps epistemic obligations have 
analogous properties.” Since the complex topic of doxastic volun- 
tarism would require article-length treatment in its own right, J set 
this issue aside and confine my attention to other issues. Although 
I do not accept the GD conception of justification, I take it as 
given for purposes of the present discussion and explore where it 
leads. In any case, what is ultimately crucial for internalism is the 
accessibility requirement that the GD conception hopes to ration 
alize. Even if the GD conception fails to provide a good rationale, 
internalism would be viable if some other rationale could be pro- 
vided for a suitable accessibility requirement. 
IL DIRECT KNOWABILITY AND STRONG INTERNALISM 

The initial KJ constraint was formulated in terms of knowability plain 
and simple, but proponents of internalism often add the further quab 
ification that determinants of justification must be directly knowable 
by the cognitive agent. Ginet, for example, writes as follows: 


Every one of every set of facts about S’s position that minimally suffices 
to make S, at a given time, justified in being confident that p must be d+ 


rectly recognizable to S at that time (op. cit., p. 34). 
Similarly, Chisholm writes: 


[T]he concept of epistemic justification is...internal and immediate in 
that one can find out derecth, by reflection, what one is justified in beliew 
ing at any time."® 


Thus, Ginet and Chisholm do not endorse just the minimal KJ con- 
straint as earlier formulated, but a more restrictive version, which 
might be written as follows: 


4 “Doxastic Agency,” Philosophical Siudses, xL (1988): 355-64 

* Feldman’s response, however, undercuts the step from the GD conception of 
justification to the knowability constraint. If epistemic duty does not require that 
the agent be able to discharge this duty, there is no longer a rationale for the 
knowability constraint A different line of response to the voluntanam worry u 
taken by Lehrer, who suggests that epistemological analyms should focus not on be- 
lief but on acceptance, where acceptance us some sort of action that is subject to the 
will-—"A SelfProfile,” in R. Bogdan, ed., Kath Lehrer (Dordrecht: Reidel, 1981), pp. 
3-104. 

'* Theory of Knowledge, 3rd edition, p. 7, emphasis added and original emphass 
deleted 
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KJa.: The only facts that qualify as justifiers of an agent's believing p 
at Ome tare facts that the agent can readily know derecth, at t, to 
obtain or not to obtain 


An initial problem arising from KJ,,, is this: What warrants the impo- 
sition of K],, as opposed to the looser constraint, KJ? KJ was derived 
from the GD conception on the grounds that one cannot reasonably 
be expected to comply with epistemic duties unless one knows what 
those duties are. How does such an argument warrant the further con- 
clusion that direct knowledge of justification must be available? Even 
indirect knowledge (whatever that is) would enable an agent to com- 
ply with his epistemic duties. So the second step of the argument for 
internalism cannot properly be revised to feature KJ,, in place of KJ. 
Proponents of KJ,,, might reply that direct forms of knowledge are 
more powerful than indirect knowledge, but this reply is unconvinc- 
ing. The power requirement was already built into the original version 
of KJ, and it is unclear how directness adds anything of significance on 
that score. Whether KJa can be derived from GD is a serious problem, 
because the argument for internalism rests on something like the dt 
rectness qualification. I shall say more about this later; for now I set 
this point aside in order to explore where KJa leads. 

What modes of knowledge count as direct? At least one form of di 
rect knowledge is introspection. A reason for thinking that intro- 
spection is what Chisholm means by direct knowledge is that he 
restricts all determiners of justification to conscious states: | 


A consequence of our “internalistic” theory of knowledge is that, if one 
is subject to an epistemic requirement at any time, then this require- 
ment is imposed by the conscious state in which one happens to find one- 
self at that time (bid., pp. 59-60). 


Since he restricts justifiers to conscious states, it is plausible to as- 
sume that direct knowledge, for Chisholm, means introspective 
knowledge, and knowledge by “reflection” coincides with knowledge 
by introspection.’ At least in the case of Chisholm, then, KJ,,, might 
be replaced by: 


KJ: The only facts that qualify as justifiers of an agent's believing p at 
time ¢ are facts that the agent can readily know by tntrospechon, at t, 
to obtain or not to obtain. 


Other epistemologists who restrict justifiers to conscious states or discuss 
access in terms of introspection include Moser, p. 174; Feldman, “Having 
Evidence,” in D. Austin, ed, Philosophical Anaijas (Dordrecht: Kluwer, 1988), pp. 
83-104; and Audi, “Causalist Internalism,” Amencan Philosophical Quarterly, XXVI, 4 
(1989): 309-20. 


° hee 
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Now, the only facts that an agent can know by introspection are facts 
concerning what conscious states he is (or is not) currently in, so 
these are the only sorts of facts that qualify as justifiers under KJ... 
This form of internalism may be called strong internalism: 


SI: Only facts concerning what conscious states an agent is in at time ¢ 
are justifiers of the agent’s beliefs at 4 


Strong internalism, however, is an unacceptable approach to justifi- 
cation, for it has serious, skepticism-breeding, consequences. This is 
demonstrated by the problem of stored beliefs. At any given time, the 
vast majority of one’s beliefs are stored in memory rather than occur- 
rent or active. Beliefs about personal data (for example, one’s social 
security number), about world history, about geography, or about 
the institutional affiliations of one’s professional colleagues, are al- 
most all stored rather than occurrent at a given moment. Further- 
more, for almost any of these beliefs, one’s conscious state at the 
time includes nothing that justifies it. No perceptual experience, no 
conscious memory event, and no premises consciously entertained at 
the selected moment will be justificationally sufficient for such a be- 
lief. According to strong internalism, then, none of these beliefs is 
justified at that moment. Strong internalism threatens a drastic 
diminution in the stock of beliefs ordinarily deemed justified, and 
hence in the stock of knowledge, assuming that justification is neces- 
sary for knowledge. This is a major count against this type of theory. 

Feldman anticipates this problem because his own account of hav- 
ing evidence also implies that only consciously entertained factors 
have evidential force (op. at, pp. 98-99). Feldman tries to meet the 
threat by distinguishing between occurrent and dispositional senses 
of epistemic terms. (He actually discusses knowledge rather than jus- 
tification, but I shall address the issue in terms of justification be- 
cause that is the target of our investigation.) Feldman is not simply 
restating the familiar point that ‘belief has occurrent and disposi- 
tional senses. He is proposing that the term ‘justified’ is ambiguous 
between an occurrent and a dispositional sense. Feldman apparently 
claims that in the case of stored beliefs, people at most have dispost 
tional justification, not occurrent justification. 

There are two problems with this proposal. First, if having a dispo- 
sition to generate conscious evidential states qualifies as a justifier of 
a belief, why would this not extend fom memorial to perceptual dis- 
positions? Suppose a train passenger awakes from a nap but has not 
yet opened his eyes. Is he justified in believing propositions about 
the details of the neighboring landscape? Surely not. Yet he is dis- 
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posed, merely by opening his eyes, to generate conscious evidential 
states that would occurrently justify such beliefs. So the dispositional 
approach is far too permissive to yield an acceptable sense of ‘Justt 
fied’.1® Second, can an internalist, especially a strong internalist, live 
with the idea that certain dispositions count as justifiers? Having or 
not having a disposition (of the requisite type) is not the sort of fact 
or condition that can be known by introspection. Thus, the proposal 
to supplement the occurrent sense of ‘justified’ with a dispositional 


sense of ‘justified’ is simply the abandonment of strong internalism. 
IL INDIRECT KNOWABILITY AND WEAK INTERNALISM 


The obvious solution to the problem of stored beliefs is to relax the 
KJ constraint: allow justifiers to be merely indirectly knowable. ‘This 
yields: 


KJ na: Lhe only facts that qualify as justifiers of an agent's believing p at 
time tare facts that the agent can readily know at 4, either directly 
or indirectly, to obtain or not to obtain. 


The danger here is that indirect knowledge might let in too much 
from an internalist perspective. How are externalist forms of knowl 
edge—for example, perceptual knowledge—to be excluded? Clearly, 
internalism must propose specific forms of knowledge that conform 
with its spirit. It is fairly clear how internalism should deal with the 
problem of stored beliefs: simply allow knowledge of justifiers to in- 
clude memory retrieval. Stored evidence beliefs can qualify as justifiers 
because the agent can know that they obtain by the compound route of 
first retrieving them from memory and then introspecting their con 
scious contents. This yields the following variant of the KJ constraint 


KJ etree: Phe only facts that qualify as justrfiers of an agent’s believing p 
at time tare facts that the agent can readily know, at t to obtain 
or not to obtain, by mntrospection and/or memory retrieval. 


This KJ constraint allows for a more viable form of internalism than 
strong internalism. We may call it weak tnternalism, and initially artic- 
ulate it through the following principle: 
WI: Only facts concerning what conscious and/or stored mental states 
an agent is in at time tare justifiers of the agent’s beliefs at t 
WI will certify the justification of many stored beliefs, because agents 
often have other stored beliefs that evidentially support them. A per- 


" Feldman t reply that there 1s an important distincuon between memorial 
and perceptual itions, but it 1s not clear on what bams a principled distinc- 
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son who believes that Washington, D.C. is the capital of the United 
States may have a stored belief to the effect that a map of the U.S. he 
recently consulted showed Washington as the capital. The latter 
stored belief is what justifies the former one. So weak internalism is 
not plagued with the problem of stored justified beliefs. Weak inter- 
nalism seems to be a legitimate form of internalism because even 
stored beliefs qualify, intuitively, as internal states. 

Although weak internalism is better than strong internalism, it too 
faces severe problems. First is the problem of forgotten evidence.!* Many 
justified beliefs are ones for which an agent once had adequate evi 
dence that she subsequently forgot. At the time of epistemic ap- 
praisal, she no longer possesses adequate evidence that is retrievable 
from memory. Last year, Sally read a story about the health benefits 
of broccoli in the “Science” section of the New York Times. She then 
justifiably formed a belief in broccoli’s beneficial effects. She still re- 
tains this belief but no longer recalls her original evidential source 
(and has never encountered either corroborating or undermining 
sources). Nonetheless, her broccoli belief is still justified, and, if 
true, qualifies as a case of knowledge. Presumably, this is because her 
past acquisition of the belief was epistemically proper. But past ac- 
quisition is irrelevant by the lights of internalism (including weak in- 
ternalism), because only her current mental states are justifiers 
relevant to her current belief. All past events are “external” and 
therefore irrelevant according to internalism. 

It might be replied that Sally does currently possess evidence in 
support of her broccoli belief. One of her background beliefs, we 
may suppose, is that most of what she remembers was learned in an 
epistemically proper manner. So does she not, after all, now have 
grounds for the target belief? Admittedly, she has some evidence, but 
is this evidence sufficient for justification? Surely not. In a variant 
case, suppose that Sally still has the same background belief— 
namely, that most of what she remembers was learned in an epistem- 
ically proper manner—but she in fact acquired her broccoli belief 
from the National Inqutrer rather than the New York Times. So her 
broccoli belief was never acquired, or corroborated, in an epistemi- 
cally sound manner. Then even with the indicated current back- 
ground belief, Sally cannot be credited with justifiably believing that 
broccoli is healthful. Her past acquisition is still relevant, and dect 


* This sort of problem 1s discussed by Gilbert Harman, Change m View (Cam 
bridge: MIT, 1986); Thomas Senor, “Internalist Foundationalism and the Justifica- 
tion of Memory Belief,” Synthese, xctv (1998): 453-76; and Audi, “Memonal 


Justification,” Philosophacal Topecs, xxi (1995): 31-45. 
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sive. At least it is relevant so long as we are considering the “epistem- 
izing” sense of justification, in which justification carries a true belief 
a good distance toward knowledge. Sally’s belief in the healthfulness 
of broccoli is not justified in that sense, for surely she does not know 
that broccoli is healthful given that the National Inquirer was her sole 
source of information. 

The category of forgotten evidence is a problem for weak internal 
ism because, like the problem of stored beliefs facing strong inter- 
nalism, it threatens skeptical outcomes. A large sector of what is 
ordinarily counted as knowledge are beliefs for which people have 
forgotten their original evidence. 

In reply to the problem of forgotten evidence, Steup” offers the 
following solution. An additional requirement for memorial states to 
justify a belief that p, says Steup, is that the agent have adequate evi 
dence for believing the following counterfactual: “If she had encoun- 
tered p in a questionable source, she would not have formed the 
belief that p.” Steup’s suggestion is that in the National Inqusrer vari- 
ant, Sally fails to have adequate evidence for this counterfactual, and 
that is why her broccoli belief is not justified. My response to this 
proposal is twofold. First, the proposed requirement is too strong to 
impose on memorially justified belief. It is quite difficult to get ade- 
quate evidence for the indicated counterfactual. Second, the pro- 
posed requirement seems too weak as well. Sally might have 
adequate evidence for the counterfactual but still be unjustified in 
holding her broccoli belief. She might have adequate evidence for 
the counterfactual without its being true; but if it is not true and the 
rest of the story is as I told it, her broccoli belief is not justified. So 
Steup’s internalist-style solution does not work. 

A second problem confronting weak internalism is what I call the 
problem of concurrent retrieval Principle WI says that only conscious and 
stored mental states are justifiers, but it does not say that all sets or 
conjunctions of such states qualify as justifiers.”! Presumably, which 
sets of such states qualify is a matter to be decided by reference to 
KJ attrer if a certain set of stored beliefs can all be concurrently re- 
trieved at time tand concurrently introspected, then they would pass 
the test of KJ,.4,-, and could qualify as justifiers under the principle 
of indirect knowability. But if they cannot all be concurrently re- 
trieved and introspected at 4 they would fail the test. Now it is clear 
that the totality of an agent’s stored credal corpus at a time cannot 


= His proposal was part of his commentary (see acknowledgment note above). 
=€ Obviously, one would need to reject the principle that the knowability of fact 
A and the knowabulity of fact B entail the knowability of the conjunctive fact, A & B, 
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be concurrently retrieved from memory. So that set of stored beliefs 
does not qualify as a justifier for purposes of weak internalism. Un- 
fortunately, this sort of belief set is precisely what certain types of in- 
ternalist theories require by way of a justifier. Consider holistic 
coherentism, which says that a proposition p is justified for person S 
at time ‘if and only if p coheres with S’s entire corpus of beliefs at ż 
(including, of course, the stored beliefs). A cognitive agent could as- 
certain, at 4 whether p coheres with her entire corpus only by con- 
currently retrieving all of her stored beliefs. But such concurrent 
retrieval is psychologically impossible.” Thus, the critically relevant 
justificational fact under holistic coherentism does not meet even 
the indirect knowability constraint, much less the direct knowability 
constraint. Here is a clash, then, between a standard internalist the- 
ory of justification and the knowability rationale under scrutiny. Ei- 
ther that rationale is indefensible, or a familiar type of internalism 
must be abandoned at the outset. Nor is the problem confined to co- 
herentism. Internalist foundationalism might also require concur- 
rent retrieval of more basic (or low-level) beliefs than it is 


psychologically feasible to retrieve. 
IV LOGICAL AND PROBABILISTIC RELATIONS 


As these last examples remind us, every traditional form of internal- 
ism involves some appeal to logical relations, probabilistic relations, 
or their ilk. Foundationalism requires that nonbasically justified be- 
liefs stand in suitable logical or probabilistic relations to basic beliefs; 
coherentism requires that one’s system of beliefs be logically consis- 
tent, probabilistically coherent, or the like. None of these logical or 
probabilistic relations is itself a mental] state, either a conscious state 
or a stored state. So these relations do not qualify as justifiers accord- 
ing to either SI or WI. The point may be illustrated more concretely 
within a foundationalist perspective. Suppose that Jones possesses a 
set of basic beliefs at t whose contents logically or probabilistically 
support proposition p. This property of Jones’s basic beliefs—the 
property of supporting proposition p—is not a justifier under WI, for 
the property itself is neither a conscious nor a stored mental state. 
Nor is the possession of this property by these mental states another 
mental state. So WI has no way of authorizing or permitting Jones to 
believe p. Unless WI is liberalized, no nonbasic belief will be juste 
fied, which would again threaten a serious form of skepticism. 


™ The “doxastc apot aa invoked by Bonjour (op. at, pp. 101-06) seems 
to assume that this is possible, but this 1s simply an undefended assumption. Pol 
lock (op. at, p. 186) also raises the problem identified here, though in slightly dif 
ferent terms. 
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Can this problem be remedied by simply adding the proviso that 
all properties of conscious or stored mental states also qualify as jus- 
tifiers?® This proviso is unacceptably permissive for internalism. One 
property of many conscious and stored mental states is the property 
of bang caused by a reliable process, yet surely internalism cannot admit 
this archetypically externalist type of property into the class of justi- 
fiers. How should the class of properties be restricted? An obvious 
suggestion is to include only formal properties of mental states, that 
is, logical and mathematical properties of their contents. But should 
all formal properties be admitted? This approach would fly in the 
face of the knowability or accessibility constraint, which is the guid- 
ing theme of internalism. Only formal properties that are knowable 
by the agent at the time of doxastic decision should be counte- 
nanced as legitimate justifiers under internalism. Such properties, 
however, cannot be detected by introspection and/or memory re- 
trieval. So some knowing operations suitable for formal properties 
must be added, yielding a liberalized version of the KJ constraint. 

How should a liberalized KJ constraint be designed? The natural 
move is to add some selected computational operations or algo- 
rithms, procedures that would enable an agent to ascertain whether 
a targeted proposition p has appropriate logical or probabilistic rela- 
tions to the contents of other belief states he is in. Precisely which 
computational operations are admissible? Again, problems arise. 
The first is the problem of the doxastic decision interval 

The traditional idea behind internalism is that an agent is justified 
in believing p at time t if the evidential beliefs (and perhaps other, 
nondoxastic states) possessed at t have an appropriate logical or 
probabilistic relation to p. In short, justification is conferred simulta 
neously with evidence possession. Feldman makes this explicit “For 
any person S and proposition p and time t, S epistemically ought to 
believe p at t if and only if p is supported by the evidence S has at t” 
(op. cit., p. 254). Once the knowability constraint is introduced, how- 
ever, simultaneous justification looks problematic. If justification is 
contingent on the agent's ability to know what justifiers obtain, the 
agent should not be permitted to believe a proposition p at t unless 
she can know by t whether the relevant justifiers obtain. Since it nec- 
essarily takes some time to compute logical or probabilistic relations, 
the simultancity model of justification needs to be revised so that an 
agent's mental states at t justify her in believing only p at t + €, for 


= More precisely, the contemplated proviso should say that the possession of 
any property by a mental state (or set of mental states) qualifies as a justifier. This 
reading will be understood wherever the text talks loosely of “properties.” 
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some suitable c. The value of € cannot be too large, of course, lest 
the agent’s mental states change so as to affect the justificational sta- 
tus of p. But e must be large enough to allow the agent time to deter- 
mine the relevant formal relations. 

These two conditions—(1) avoid mental change, but (2) allow 
enough time to compute formal relations—may well be jointly unsatis- 
fiable, which would pose a severe problem for internalism. Mental 
states, including perceptual states that generate new evidence, change 
very rapidly and they could easily change before required computa- 
tions could be executed. On the other hand, although mental states 
do change rapidly, the agents belief system might not be epistemically 
required to reflect or respond to each change until interval e has 
elapsed. Some doxastic decision interval, then, might be feasible. 

Is there a short enough decision interval during which justifica 
tionally pertinent formal properties can be computed? Coherentism 
says that S is justified in believing proposition p only if p coheres with 
the rest of S’s belief system held at the time. Assume that coherence 
implies logical consistency. Then coherentism requires that the logi 
cal consistency or inconsistency of any proposition p with S's belief 
system must qualify as a justifier. But how quickly can consistency or 
inconsistency be ascertained by mental computationr As Christopher 
Cherniak™ points out, determination of even tautological consistency 
is a computationally complex task in the general case. Using the 
truth-table method to check for the consistency of a belief system 
with 138 independent atomic propositions, even an ideal computer 
working at “top speed” (checking each row of a truth table in the 
time it takes a light ray to traverse the diameter of a proton) would 
take twenty billion years, the estimated time from the “big-bang” 
dawn of the universe to the present. Presumably, twenty billion years 
is not an acceptable doxastic decision interval! 

Any reasonable interval, then, is too constraining for garden-variety 
coherentism. The knowability constraint again clashes with one of 
the stock brands of internalism.” Dyedin-the-wool internalists might 
be prepared to live with this result. “So much the worse for tradi 
tional coherentism,” they might say, “we can live with its demise.” 
But this does not get internalism entirely off the hook. There 
threaten to be many logical and probabilistic facts that do not qualify 


H “Computational Complexty and the Universal Acceptance of Logic,” this 
JOURNAL, LXXXI, 12 (December 1984): 739-58. 

= This computational difficulty for coherentism is identfled by Kornblith, "The 
Unattainability of Coherence,” in J. Bender, ed., The Current Stats of the Coherence 
Theory (Dordrecht: Kluwer, 1989), pp. 207-14. 
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as justifiers because they require too long a doxastic interval to com- 
pute. Furthermore, it is unclear what is a principled basis for decid- 
ing what is too long. This quandary confronting internalism has 
apparently escaped its proponents’ attention. 

A second problem for logical and probabilistic justifiers is the 
availability problem. Suppose that a particular set of computational oper- 
ahons—call it COMP—is provisionally selected for inclusion along- 
side introspection and memory retrieval. COMP might include, for 
example, a restricted (and hence noneffective) use of the truth-table 
method, restricted so as to keep its use within the chosen doxastic 
decision interval.” This yields a new version of the KJ constraint: 


KJ nt+reetcomr: Lhe only facts that qualify as justfers of an agent’s be- 
lieving p at time ¢ are facts that the agent can readily 
know within a suitable doxastic decision interval via mm- 


trospection, memory retneval, and/or COMP. 


Now, the KJ constraint is presumably intended to apply not only to the 
cleverest or best-trained epistemic agents but to all epistemic agents, in- 
cluding the most naive and uneducated persons on the street. After all, 
the point of the knowability constraint is that justifiers should be facts 
within the purview of every epistemic agent. Under the GD conception, 
compliance with epistemic duty or responsibility is not intended to be 
the private preserve of the logical or mathematical elite. It is something 
that ought to be attained—and should therefore be attainable—by any 
human agent. The truthtable method, however, does not seem to be 
in the intellectual repertoire of naive agents, so it is legitimate to in- 
clude COMP operations within a KJ constraint. Unlike introspection 
and memory retrieval, it is not available to all cognitive agents. 

It may be replied that computational operations of the contem- 
plated sort would be within the capacity of normal human agents. No 
super-human computational powers are required. Computing power, 
however, is not the issue. A relevant sequence of operations must also 
be available in the agent’s intellectual repertoire; that is, she must 
know which operations are appropriate to obtain an answer to the 
relevant (formal) question.” Since truth-table methods and other 


= Because of the contemplated restriction, there will be many questions about 
formal facts to which COMP cannot deliver answers. Thus, formal facts that might 
otherwise qualify as justifiers will not so qualify under the version of the KJ con- 
straint that incorporates COMP. 

© Propositional (or "declaratrve") knowledge of the appropriate sequence of op- 
erations is, perhaps, an unduly restrictive requirement. It would suffice for the 
agent to have “procedural” skills of the right sort. But even such skills will be lack- 


ing in naive cognitive agents. 
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such algorithms are probably not in the repertoire of ordinary cogni 
tive agents, they cannot properly be included in a KJ constraint. 

A third problem concerns the proper methodology that should be 
used in selecting a KJ constraint that incorporates computational op- 
erations. As we see from the first two problems, a KJ constraint that 
conforms to the spirit of the GD rationale must reflect the basic cog- 
nitive skills or repertoires of actual human beings. What these basic 
repertoires consist in, however, cannot be determined a priori. It 
can only be determined with the help of empirical science. This fact 
fundamentally undermines the methodological posture of internal 
ism, a subject to which I shall return in section Vu. 

Unul now, I have assumed a unrversal accessibility constraint, one 
that holds for all cognitive agents. But perhaps potential justifiers for 
one agent need not be potential justifiers for another. Justifiers might 
be allowed to vary from agent to agent, depending on what is know 
able by the particular agent. If two agents have different logical or 
probabilistic skills, then some properties that do not qualify as justi 
fiers for one might yet qualify as justifiers for the other. Indeed, the 
constraint KJ ne+re:+comp Might be read in precisely this agent-relativized 
way. The subscripts may be interpreted as indicating knowledge routes 
that are available to the agent mn question, not necessarily to all agents. 

If KJ constraints are agent relativized as a function of differences in 
knowledge skills, this means that two people in precisely the same evi 
dential state (in terms of perceptual situation, background beliefs, and 
so on) might have different epistemic entitlements. But if the two 
agents are to comply with their respective epistemic duties, each must 
know which knowledge skills she has. This simply parallels the second 
step of the internalist’s original three-step argument. If one’s epis- 
temic duties or entitlements depend on one’s knowledge skills (for ex- 
ample, on one’s computational skills), then compliance with one’s 
duties requires knowledge of which skills one possesses. There are two 
problems with this approach. First, it is unlikely that many people—es- 
pecially ordinary people on the street—have this sort of knowledge, 
and this again threatens large-scale skepticism. Second, what is now re- 
quired to be known by the agent is something about the truth-getiing 
power of her cognitive skills—that is, the power of her skills in detect- 
ing justifiers. This seems to be precisely the sort of external property 
that internalists regard as anathema. How can they accept this solution 
while remaining faithful to the spirit of internalism?* 


= It might be argued that internalism’s spirit leads to a pee is pa Sie 
even for universal versions of a KJ constraint, not just for agen i versions. 
Perhaps so; but so much the worse for the general form of internalism. 
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V. EPISTEMIC PRINCIPLES 

When the KJ constraint speaks of justifiers, it is not clear exactly 
what these comprehend. Specifically, do justifiers include epistemic 
principles themselves? I believe that principles should be included, 
because epistemic principles are among the items that determine 
whether or not an agent is justified in believing a proposition, which 
is Just how ‘justifiers’ was defined. Furthermore, true epistemic prin- 
ciples are items an agent must know if she is going to determine her 
epistemic duties correctly. Knowledge of her current states of mind 
and their properties will not instruct her about her epistemic duties 
and entitlements unless she also knows true epistemic principles. 

How are epistemic principles to be known, according to internal- 
ism? Chisholm” says that central epistemic principles are normative 
supervenience principles, which (when true) are necessarily true. 
Since they are necessary truths, they can be known a priort—in par- 
ticular, they can be known “by reflection.” 


The internalist assumes that, merely by reflecting upon his own con- 
scious state, he can formulate a set of epistemic principles that will en- 
able him to find out, with respect to any possible belief he has, whether 
he is justified in having that belief.™ 
This passage is ambiguous as to whether (correct) epistemic princi 
ples are accessible on reflection just to epistemologists, or accessible 
to naive epistemic agents as well. The latter, however, must be re- 
quired by internalism, because justifiers are supposed to be deter- 
minable by all epistemic agents. 

Are ordinary or naive agents really capable of formulating and rec- 
ognizing correct epistemic principles? This seems highly dubious. 
Even many career-long epistemologists have failed to articulate and 
appreciate correct epistemic principles. Since different epistemolo- 
gists offer disparate and mutually conflicting candidates for epis- 
temic principles, at most a fraction of these epistemologists can be 
right. Perhaps none of the principles thus far tendered by epistemol- 
ogists is correct! In light of this shaky and possibly dismal record by 
professional epistemologists, how can we expect ordinary people, 
who are entirely ignorant of epistemology and its multiple pitfalls, to 
succeed at this task?”! Nor is it plausible that they should succeed at 
this task purely “by reflection” on their conscious states, since among 
the matters epistemic principles must resolve is what computational 


™ “The Status of Epistemic Principles,” Mots, xav (1990): 2090-15. 

= Theory of Knowledge, 3rd edition, p 76; emphasis omitted. 

* A simular worry is expressed by Alston in “Internalism and Externalism in Epu- 
temology,” pp. 221-22. 
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skills are within the competence of ordinary cognizers. I do not see 
how this can be answered a priori, “by reflection.” 

A crippling problem emerges for internalism. If epistemic princi 
ples are not knowable by all naive agents, no such principles can 
qualify as justifiers under the KJ constraint. If no epistemic princi 
ples so qualify, no proposition can be justifiably believed by any 
agent. Wholesale skepticism follows. 

VI THE CORE DILEMMA FOR THE THREE-STEP ARGUMENT 

I raise doubts here about whether there is any cogent inferential 
route from the GD conception to internalism via an acceptable KJ 
constraint. Here is the core dilemma. The minimal, unvarnished ver- 
sion of the KJ constraint does not rationalize internalism. That sim- 
ple constraint merely says that justifiers must be readily knowable, 
and some readily knowable facts might be external rather than inter- 
nal. If all routes to knowledge of justifiers are allowed, then knowl 
edge by perception must be allowed. If knowledge by perception is 
allowed, then facts of an external sort could qualify for the status of 
justifiers. Of course, no epistemologist claims that purely external 
facts should serve as justifiers. But partly external facts are nomi 
nated by externalists for the rank of justifiers. Consider properties of 
the form: being a reliable perceptual indicator of a certain environ- 
mental fact. This sort of property is at least partly external because 
reliability involves truth, and truth (on the usual assumption) is ex- 
ternal. Now suppose that a certain auditory perceptual state has the 
property of being a reliable indicator of the presence of a mourning 
dove in one’s environment. Might the possession of this reliable indi 
catorship property qualify as a justifier on the grounds that it is in- 
deed readily knowable? If every route to knowledge is legitimate, I 
do not see how this possibility can be excluded. After all, one could 
use past perceptions of mourning doves and their songs to deter- 
mine that the designated auditory state is a reliable indicator of a 
mourning dove’s presence. So if unrestricted knowledge is allowed, 
the (partly) external fact in question might be perfectly knowable. 
Thus, the unvarnished version of the KJ constraint does not exclude 
external facts from the ranks of the justifiers. 

The simple version of the KJ constramt, then, does not support in- 
ternalism. Tacit recognition of this is what undoubtedly leads inter- 
nalists to favor a “direct” knowability constraint Unfortunately, this 
extra rider is not rationalized by the GD conception. The GD con- 
ception at best implies that cognitive agents must know what justi- 
fiers are present or absent. No particular types of knowledge, or paths 
to knowledge, are intimated. So the GD conception cannot rational- 
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ize a restrictive version of the KJ constraint that unambiguously 
yields internalism. 

Let me put the point another way. The GD conception implies 
that justifiers must be readily knowable, but are internal facts always 
more readily knowable than external facts? As discussed earlier, proba- 
bilistic relations presumably qualify as internal, but they do not seem 
to be readily knowable by human beings. An entire tradition of psy- 
chological research on “biases and heuristics” suggests that naive 
agents commonly commit probabilistic fallacies, such as the “con- 
junction fallacy,” and use formally incorrect judgmental heuristics, 
such as the representativeness heuristic and the anchoring-and-ad- 
justment heuristic.” If this is right, people’s abilities at detecting 
probabilistic relationships are actually rather weak. People’s percep- 
tual capacities to detect external facts seem, by contrast, far superior. 
The unqualified version of the KJ constraint, therefore, holds little 
promise for restricting all justifiers to internal conditions in prefer- 
ence to external conditions, as internalism requires.” 

Vil THE METHODOLOGY OF EPISTEMOLOGY: EMPIRICAL OR A PRIOR? 
Internalism standardly incorporates the doctrine that epistemology 
is a purely a priori or armchair enterprise rather than one that needs 
help from empirical science. Chisholm puts the point this way: 


The epistemic principles that [the epistemologist] formulates are prin- 
ciples that one may come upon and apply merely by sitting in one’s 
armchair, so to speak, and without calling for any outside assistance. In 
a word, one need only consider one’s own state of mind.* 


Previous sections already raised doubts about the merits of apriorism 
in epistemology, even in the context of the theoretical architecture 
presented here. I now want to challenge the viability of apriorism in 
greater depth. 

Assume that, despite my earlier reservations, an internalist restric- 
tion on justifiers has somehow been derived, one that allows only 
conscious states and certain of their nonexternal properties to serve 


™ See Amos Tversky and Daniel Kahneman, “Judgment under Uncertainty: 
Heuristics and Biases,” in Kahneman, P. Slovic, and Tversky, eds., Aidgmeni under 
Uncertarnty (New York: Cambridge, 1982), pp. 3-20; and Tversky and Kahneman, 
“Extensional versus Intuitrve Reasoning: The Conjunction Fallacy m Probability 
Judgment,” Prpchological Reotew, XC (1983) 293-315. 

™ It is not really clear, moreover, why logical or probabilistic facts intuitively 
count as “internal” facts. They certainly are not internal in the same sense in which 
mental states are internal This uw an additional problem about the contours of in- 
ternaliam. 


* Theory of Knowledge, 8rd edition, p 76. 
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as justifiers. How should the epistemologist identify particular com 
scious states and properties as justifiers for specific propositions (or 
types of propositions) In other words, how should specific epistemic 
principles be crafted? Should the task be executed purely a priori, or 
can scientific psychology help? 

For concreteness, consider justifiers for memory beliefs. Suppose 
an adult consciously remembers seeing, as a teenager, a certain 
matinee idol. This ostensible memory could have arisen from imag- 
ination, since he frequently fantasized about this matinee idol and 
imagined seeing her in person. What clues are present in the cur- 
rent memory impression by which he can tell whether or not the 
recollection is veridical? This is precisely the kind of issue which in- 
ternalist epistemic principles should address. If there are no differ- 
ences in features of memory states that stem from perceptions of 
real occurrences versus features of states that stem from mere 
imagination, does this not raise a specter of skepticism over the do- 
main of memory? If there are no indications by which to distin- 
guish veridical from nonveridical memory impressions, can we be 
justified in trusting our memory impressions? Skepticism aside, 
epistemologists should surely be interested in identifying the fea- 
tures of conscious memory impressions by which people are made 
more or less justified (or prima facie justified) in believing things 
about the past. 

Epistemologists have said very little on this subject. Their discus- 
sions tend to be exhausted by characterizations of memory impres- 
sions as “vivid” or “nonvivid.” There is, I suspect, a straightforward 
reason for the paucity of detail. It is extremely difficult, using purely 
armchair methods, to dissect the microfeatures of memory expert 
ences so as to identify telltale differences between trustworthy and 
questionable memories. On the other hand, empirical methods have 
produced some interesting findings, which might properly be in- 
fused into epistemic principles in a way entirely congenial to inter- 
nalism. Important research in this area has been done by Marcia 
Johnson and her colleagues.” I shall illustrate my points by brief ref- 
erence to their research. 

Johnson calls the subject of some of her research reality monitoring. 
She tries to characterize the detectable differences between (con- 
scious) memory traces derived from veridical perception of events ver- 


™ See Johnson and Carol Raye, “Reality Monitoring,” Psychological Review, 
LXXXV (1981): 67-85; and Johnson, Mary Foley, Aurora Suengas, and Raye, “Phe- 
nomenal Characteristics of Memories for Perceived and Imagined Antobiographi- 
cal Events,” journal of Experimental Prypchology: General, cxvn (1988): 371-76. 
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sus memory traces generated by mere imaginations of events.” John 
son and Raye (op. at) propose four dimensions along which memory 
cues will typically differ depending on whether their origin was percep- 
tual or imaginative. As compared with memories that originate from 
imagination, memories originating from perception tend to have (1) 
more perceptual information (for example, color and sound), (2) 
more contextual information about time and place, and (3) more 
meaningful detail. When a memory trace is rich along these three di- 
mensions, this is evidence of its having originated through perception. 
Memories originating from imagination or thought, by contrast, tend 
to be rich on another dimension: they contain more information about 
the cognitive operations involved in the original thinkings or imagin- 
ings (for example, effortful attention, image creation, or search). Per- 
ception is more automatic than imagination, so a memory trace that 
originates from perception will tend to lack attributes concerning ef- 
fortful operations. Johnson and Raye therefore suggest that differences 
in average value along these types of dimensions can form the basis for 
deciding whether the origin of a memory is perceptual or nonpercep- 
tual. A memory with a great deal of visual and spatial detail, and with- 
out records of intentional constructive and organizational processes, 
should be judged to have been perceptually derived.” 

Epistemologists would be well-advised to borrow these sorts of 
ideas and incorporate them into their epistemic principles. A person 
is (prima facie) justified in believing in the real occurrence of an os- 
tensibly recalled event if the memory trace is strong on the first 
three dimensions and weak on the fourth dimension. Conversely, an 
agent is unjustified in believing in the real occurrence of the re- 
called event if the memory trace is strong on the fourth dimension 
but weak on the first three dimensions. All of these dimensions, of 
course, concern features of conscious experience. For this reason, 
internalist epistemologists should be happy to incorporate these 
kinds of features into their epistemic principles. 

Let me distinguish two categories of epistemologically significant 
facts about memory experience which empirical psychology might 


™ Memory errors are not confined, of course, to confusions of actual with imag- 
ined events. There are also errors that arise from confusing, or blending, two ac- 
tual events. But this research of Johnson's focuses on the actual/nonactual (or 

ved versus magined) problem. 

* They also recognize that people can compare a target memory with memories 
of contextually related events to assess the target’s verndicality. This kind of “coher- 
ence” factor is a stockin-trade of epistemology, however, and hence not a good ex- 
ample of the distunctive contributions psychology can make to this subject. | 
therefore pass over it 
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provide. First, as we have seen, it might identify types of representa- 
tional materials which are generally available in people’s memory ex- 
periences. Second, it might indicate which of these representational 
materials are either reliable or counter-reliable indicators of the 
veridicality of the ostensibly recalled events. Is the reliability of a 
memory cue a legitimate issue from an internalist perspective? It 
might be thought not, since reliability is usually classed as an exter- 
nal property. But epistemologists might use reliability considerations 
to decide which memory characteristics should be featured in epis- 
temic principles. They need not insert reliability per se into the prin- 
ciples. There is nothing in our present formulation of internalism, at 
any rate, which bars the latter approach. Any KJ constraint provides 
only a necessary condition for being a justifier; it leaves open the 
possibility that additional necessary conditions, such as reliable indi- 
cation, must also be met. Indeed, many internalists do use reliability 
as a (partial) basis for their choice of justifiers. BonJour (op. at, p. 
7) says that the basic role of justification is that of a means to truth, 
and he defends coherence as a justifier on the ground that a coher- 
ent system of beliefs is likely to correspond to reality. This point 
need not be settled definitively, however. There are already ade- 
quate grounds for claiming that internalism cannot be optimally 
pursued without help from empirical psychology, whether or not re- 


liability is a relevant consideration. 
VII. CONCLUSION 


Let us review the parade of problems infecting internalism which we 
have witnessed, though not all in their order of presentation. (1) 
The argument from the GD conception of justification to internal 
ism does not work. Internalism can be derived only from a suitably 
qualified version of the KJ constraint because the unqualified ver- 
sion threatens to allow external facts to count as justifiers. No suit- 
ably qualified version of the KJ constraint is derivable from the GD 
conception. (2) A variety of qualified KJ constraints are possible, 
each leading to a different version of internalism. None of these ver- 
sions is intuitively acceptable. Strong internalism, which restricts jus- 
tifiers to conscious states, is stuck with the problem of stored beliefs. 
Weak internalism, which allows stored as well as conscious beliefs to 
count as justifiers, faces the problem of forgotten evidence and the 
problem of concurrent retrieval. (3) The question of how logical 
and probabilistic facts are to be included in the class of justifiers is 
plagued by puzzles, especially the puzzle of the doxastic decision in- 
terval and the issue of availability. (4) Epistemic principles must be 
among the class of justifiers, but such principles fail internalism’s 
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knowability requirement. (5) The favored methodology of internal 
ism—the armchair method—cannot be sustained even if we grant 
the assumption that justifiers must be conscious states. 

Internalism is rife with problems. Are they all traceable to the GD 
rationale? Could internalism be salvaged by switching to a different 
rationale? A different rationale might help, but most of the prob- 
lems raised here arise from the knowability constraint. It is unclear 
exactly which knowability constraint should be associated with inter- 
nalism, and all of the available candidates generate problematic the- 
ories. So I see no hope for internalism; it does not survive the glare 


of the spotlight. 
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CALIBRATION AND THE EPISTEMOLOGICAL ROLE OF 
BAYESIAN CONDITIONALIZATION * 


uppose that a scientist acquires some new evidence; she learns 

that e obtains. She must update her confidence in hypothesis A 

accordingly. Bayesian conditionakzation (BC) holds that her new 
confidence in A should equal the confidence that she had (prior to 
discovering ¢) that A obtains in the event that ¢ obtains. More pre- 
cisely, if pry,(-) is our probability distribution before discovering e, 
and pr,(°) is our probability distribution after discovering a then 
BC says that: 


IfQ < pry,(A) < 1 (so that before discovering e we regarded A asa 
live hypothesis, subject to confirmation and disconfirmation) 
Prale) < 1 (so that a was not believed already) 
s is the strongest claim whose truth we take ourselves to have discov 
ered 
and pray(¢) > 0 (so that pru(Ale) = Praal h & 6) / Praal À) 
then Di gee (A) = Praa (Ale). 


Coupled with Bayes’s theorem—pryj(Ale) = ptug(A) Praalelh)/ Praal- 
BC seems able to account elegantly for many important features of 
theory confirmation in science, such as the value of diverse evidence, 
the kernel of correctness in hypothetico-deductivism, the roles of 
auxiliary hypotheses and rival hypotheses and background beliefs, 
and the failure in certain cases of hypotheses to be confirmed by 
their positive instances.’ 

In section I, I review certain considerations often mentioned as pos- 
ing difficulties for BC. Curiously, several of these purported challenges 
involve cases that, strictly speaking, fall outside BC’s jurisdiction. Why, 
then, are these considerations widely perceived as making trouble for 
BC? At least partly because (I suggest) we expect BC to capture not 
merely our intuitions regarding how we should revise our opinions (that 
is, update our degrees of confidence in various claims) upon discover- 
ing new evidence, but also our intuitions regarding whether ¢ counts as 
evidence for h; we tend to regard BC as purporting to explain why vari 


* My thanks to Jeff Barrett, Jum Bogen, Mitch Green, Alan Hajek, Michael Ot- 
suka, Ric Otte, Jim Woodward, and Lyle Zynda for discussions of carly drafts, and 
to Bill Talbott for more recent discussion. 

' Many authors have argued that BC can account for these features. For discus- 
sion, see John Earman, Bayas or Bust? (Cambridge: MIT, 1992), chapter 3 (“Success 
Stories”). A careful examination of these arguments and their potential defects is 
beyond the scope of this paper. 
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ous factors make e stronger or weaker evidence for or against A. But 
(as we shall see) the notion of ¢’s being ‘evidence for A’ is not restricted 
to cases where we discover e 0 < Praal < 1,0 < Praalh) < 1, and we 
update our confidence in h accordingly. So even if BC governs every 
such updating, it does not thereby automatically account for every case 
in which ¢constitutes evidence for (or against) A. 

In section U, I distinguish two roles that BC might play in an ac- 
count of confirmation: as governing how we should update our opin- 
ions upon discovering new evidence, and as governing the steps in 
the arguments by which our current opinions are to be justified by the 
evidence that we have already assembled. The former is a diachronic 
matter; the latter concerns the arguments that we give at a particular 
moment for our opinions then. Although this is an elementary dis- 
tinction, I have not seen it drawn elsewhere. This distinction 1s use- 
ful, I argue, because if BC governs justificatory arguments, then it is 
in the right position to cash out what it is for ¢ to be evidence for (or 
against) A. In this role, BC could account for the various features of 
confirmation that it seems able to explicate so nicely, and yet avoid 
the problems that result from trying to explicate ‘eis evidence for f’ 
in terms of the difference that e's discovery makes, made, or would 
have made to our confidence in A.” 

In section m, I consider why it would be wrong for a justificatory ar- 
gument to violate BC-what sort of logical error would thereby be 
made. Diachronic Dutch Book arguments have sometimes been 
thought to explicate why it would be wrong for us to violate BC in up- 
dating our opinions. But these arguments have encountered formi 
dable obstacles. Indeed, for reasons I discuss in section I, I think that 
it is rationally permissible for an update to violate BC. I use the notion of 
calibration to elaborate the sense in which it would be wrong for a fus- 
tificatory argument to violate BC. I demonstrate that a scientist’s opin- 
ions at two steps of a justificatory argument can all be calibrated (in 
the relevant sense) only if her transition from one step to the next ac- 
cords with BC. This calibration argument, I contend, avoids difficul 
ties encountered by diachronic Dutch Book arguments. 

In applying BC to justificatory arguments rather than to di- 
achronic updates, I aim to find a role for BC to play where it could 
account for various important features of confirmation. 


? I do not contend that all serious obstacles to reconstructing scientific practice in 
accord with BC can be avoided in this way. In particular, logical omnuscience (which 
I assume throughout) and the problem of the priors require separate treatment 
Also, I presume throughout that upon making an observation, we accept a claim 
whose content expresses our new evidence, and that any evidence presented m a jus- 
trficatory argument consists of a proposition whose truth has been ascertained. 
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I 
I shall begin by briefly reviewing four alleged problems for BC and 
trying to understand just why they are supposed to be problems. 

First, consider the problem of old evidence. Suppose that A was 
formulated only after e was discovered. Since we already believe e by 
the time we formulate A, our pr,,(4) at that time is 1, so pry, (elA) = 1, 
so (by Bayes’s theorem) praalhle) = pruy(A), and so (by BC) pr, (A) 
= prya(h). Yet we may well regard eas evidence for (or against) h. 

So the problem of old evidence is typically posed.’ But BC applies 
only when, having already formulated h, we discover e and must up- 
date our confidence in h accordingly; recall that BC is specifically re- 
stricted to cases where pry,(¢) < 1. When ¢ is old evidence, there is 
no moment at which g's discovery elicits a shift in our confidence in 
h, since there is no pr,,(A), prior to discovering e from which to 
shift. Old evidence therefore falls outside of BC’s official scope. 
This makes it unclear why old evidence is commonly perceived as 
posing a problem for BC. 

I suggest that one reason why old evidence is taken to be a prob- 
lem for BC is because BC is intended to account for the various con- 
siderations that may make e evidence for A. Old evidence is a 
problem for this project because intuitively, old evidence can be evt- 
dence for h, even though old evidence cannot provoke our updating 
our confidence in A, and so BC apparently tells us nothing about 
what makes some old evidence e stronger or weaker evidence for or 
against A. In other words, although BC applies only to updatings, 
Bayesians purport to be giving an analysis of what it is for evidence to 
“count for or against a theory, or be neutral towards it,” as Colin 
Howson and Peter Urbach‘ put it, it is quite common to treat ‘eis ev- 
idence for k’ as one way of expressing the target of confirmation the- 
ory.” But no account-Bayesian or otherwise—that applies only to 
updatings can straightforwardl explain when e counts as evidence for 
h, since not all evidence for h provokes, upon its discovery, an in- 
crease in our confidence in A—as when eis discovered before we have 
even formulated A. Thus, old evidence is a problem for BC. 

One way to make an account of updating explain when old evidence 
e counts as evidence for h, even though our confidence in A is not actu- 


ally updated upon our discovering «, is to apply the account of updat- 


* For a good review of the literature, see Earman, pp. 119-36. 

4 Scsnufic Reasoning: The Bayesian Approach (Chicago: Open Court, 1998, 2nd 
edition ), p. 117 

* See, for instance, Paul Horwich, Probability and Evidence (New York: Cambndge, 
1982), p. 51; and Mark Kaplan, Deaston Theory as Philosophy (New York: Cambridge, 
1996), p. 48. 
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ing to a counterfactual updating: how we should have shifted our confi- 
dence in A upon discovering ¢, had we formulated A before discovering 
e. This appeal to counterfactual updatings has been extensively criti- 
cized, though not all find these criticisms persuasive. I cannot now 
give this proposal the scrutiny it deserves, but I need to mention one of 
its chief difficulties. What our pr,,(A) would have been, had we formu- 
lated A sometime before discovering ¢ depends upon the background 
beliefs that we held before discovering e regarding auxiliary hypotheses 
and alternatives to h. These background beliefs may be quite foreign 
to us when we judge, long after e was discovered, whether e is evidence 
for h. Consequently, such historical considerations seem out of 
place-intuitively irrelevant to 2's bearing on A. 

Of course, we might avoid this problem by applying BC not to the 
probability distribution that we would have held prior to discovering 
é, had Aalready been formulated by then, but to some other probabil 
ity distribution, which is not picked out by some exercise in counter- 
factual history. Presumably, by proper selection of this reference 
probability distribution, BC could be made to yield verdicts that agree 
with our intuitions. But we might well wonder why e's status now as 
evidence for h depends upon the way that we should have updated 
our opinions upon discovering ¢ were we beginning from a certain 
probability distribution that is neither our actual distribution at some 
past moment nor even the distribution that we would have held then, 
had A been formulated by then. I shall later suggest that the refer- 
ence probability distribution is relevant because it appears in the ar- 
gument by which we justify our current degree of confidence in A. 
Therefore, whether e is evidence for 4 depends not on some hypo- 
thetical updating, but on our current justificatory arguments. 

If we cash out ‘eis evidence for A’ in terms of some actual or counter- 
factual updating, depending upon whether e is new or old evidence, 
then we run the risk of construing ‘e is evidence for A’ very differently 
in these two cases. Intuitively, however, the temporal order should not 
enter into it. I shall ultimately give ‘eis evidence for A’ a uniform treat- 
ment, whatever the temporal order, by elaborating it in terms of ¢’s role 
in an argument justifying our current degree of confidence in h. Actu- 
ally, even when e was discovered after h was formulated but nevertheless 
some time ago, we should not construe ‘eis evidence for A’ in terms of 
the actual updating that was then provoked by g's discovery.” Whether e 


* For cntique, see Earman, pp. 123, 134, and the references cited there For de- 
fense, see, for example, Howson and Urbach, pp. 403-08. 
” Ellery Eells terms this e ‘old new evidence’ in his “Problems of Old Evidence,” 
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now counts as evidence for h does not depend on whether at the time it 
was discovered, ¢ provoked an updating that raised our confidence in A. 
That past updating reflected our opinions then regarding various auxil- 
iary and rival hypotheses, whereas it is our current opinions that infhr 
ence whether e now counts as evidence for h. (Of course, whether e 
now counts as evidence for h cannot be fixed by whether, right now, 
pr(hle) > pr(A), since e was discovered some time ago, so these two 
probabilities are equal.) Again, I shall suggest that ‘eis evidence for A’ 
should be understood in terms of ¢s role in arguments by which we 
now justify our current degree of confidence in A. 

The second of the four alleged problems for BC that I wish to dis- 
cuss is that, if Ais already believed (that is, pry,(4) = 1), then discov- 
ery of some new evidence e cannot confirm A (that is, cannot raise 
our confidence in A) under BC (or any other updating rule that re- 
spects the axioms of probability). Likewise, e cannot disconfirm A if 
—h is already believed (that is, pr,(h4) = 0). Again, although this is 
reputed® to be a problem for BC, it is initially unclear just why. Like 
old evidence, this case falls outside BC’s jurisdiction, since BC re- 
quires 0 < Praa (h) < 1. One reason it poses a problem for BC, I sug- 
gest, is again that we expect BC to provide an analysis of ʻe is 
evidence for /', and this project is threatened when e intuitively sup- 
ports (or is evidence against) A and yet ¢’s discovery fails to result in 
our updating our opinion of h. Once again, I shall shortly suggest 
that the fault lies not with BC, but with using it to cash out ‘e is evi 
dence for k in terms of updating rather than in terms of an argu 
ment justifying our current opinion of A. 

The third alleged problem for BC is a sort of dual to the second: if 
h is already believed (that is, pry,(4) = 1), then according to BC, e's 
discovery cannot disconfirm 4—cannot lower our confidence in h—and 
if ~A is already believed (that is, Praa (A) = 0), then according to BC, e 
cannot confirm A. But this is inconsistent with scientific practice; new 
evidence sometimes leads scientists to take seriously a theory that they 
had rejected or to challenge a theory that they had accepted. 

Again, however, it is not initially apparent why these considera- 
tions make any trouble for BC. As Isaac Levi’ has noted, BC pur- 
ports to cover only shifts of opinion upon discovering some e that ts 
consistent with what we already fully believe. If Praal) = 0, as when A 
logically entails ~e and pr..,(4) = 1, then our rejection of 4 upon dis 
covering ¢is entirely consistent with BC, just outside BC’s scope. For 
that matter, BC requires 0 < pry,(A) < 1, so any rise of suspicions re- 


* See, for instance, Kaplan, p 
5 a S A A 1980), p. 82. 
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garding previously accepted hypotheses or rehabilitation of previ- 
ously rejected hypotheses goes beyond BC’s jurisdiction. Such shifts 
of opinion are consistent with BC. 

One reason, I think, that such shifts of opinion are often perceived 
to pose a problem for BC is again that BC is expected to explain why e 
is strong or weak evidence for or against A. Intuitively, the factors re} 
evant to e's evidential power are the same whether A is already fully 
believed, already rejected, or being entertained as a live hypothesis. 
But an account of ‘e is evidence for A’ which uses BC to cover the 
cases where A is a live hypothesis, and some other principles for the 
other cases, threatens to treat these cases in a counterintuitively 
nonuniform way. We might well be prepared to grant that along with 
BC to cover shifts of opinion upon consistent extension of our full be- 
liefs, there are further rules covering updates that involve rehabilitat- 
ing formerly rejected hypotheses or challenging formerly accepted 
hypotheses.” These shifts of opinion are intuitively quite different 
from “business as usual.” But we may be much less willing to grant 
that an explication of ‘eis evidence for A’ should treat these cases dif- 
ferently. Intuitively, the same considerations that render ¢ powerful 
evidence against A, when 0 < Praal) < 1, should have made e power- 
ful evidence against 4 even if we had already been taking pr,,,(4) = 1. 
BC’s limited range of application thus becomes an argument against 
its appearing in an analysis of ‘eis evidence for X. 

Of course, we might try to address this problem by expanding 
BC’s scope. We might begin by making pr,,,(Ale) well defined even 
when pr,,,(¢) = 0, as Bruno de Finetti” does by holding pry,(Ale) to 
be unconstrained by unconditional probabilities when pryj(¢) = 0; 
Proa(Ale) is then no longer equal to praa(h & ¢)/proia(¢) or to 
Plog A) Plog (61h) / Proia (4) - We could then hold that even when 
Plag(¢) = 0, es evidential relevance to h is given by BC~by compar- 
ing Pra (Ale) to prgg(A). But this move alone cannot resolve the 
problem. To explain why ¢ is strong or weak evidence for or against 
h, we must still give some account of why pry,(Ale) takes on the value it 
does. Since this account fails to link Pray (Ale) to prga(A), Praal), and 
the like in the usual way, the explanation of ¢’s evidential relevance to 
h will be quite different fom BC’s explanation in cases where 0 < 
Praal) < 1 and prys(e) > 0. For example, BC is often cited as ex- 
plaining what is right about hypothetico-deductivism—why the fact that 
h logically entails ¢ can sometimes make e evidence for A: if A logically 
entails e then (when pr.,(h) > 0) Praalelh) = 1, and so pr,(h) = 

* For example, “contraction strategies” are discussed by Levi, pp. 59-62. 


! Probability, Induction, and Statistics (New York: Wiley, 1972), pp. 81-82. See 
also, for Instance, Levi, p. 219; and Earman, p. 36. 
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PT og (Ale) = Dl gg(A)/prog(e), which (when Praal) < 1) exceeds 
Proaa(4). But this argument presumes pr,,(¢) > 0; if Pral) = 0, we 
would need an utterly different account of why Pleag (Ale) > prag(A). 

The fourth alleged problem for BC that I shall raise-and, arguably, 
the most fundamental] shall call the problem of wncorngiiikty, If a scien- 
tist obeys BC after discovering ¢ and then again after discovering e’, 
her Proe (h) is her pryy(Ale & e’). Therefore, a scientist who with each 
new observation alters her opinions by BC finds herself with opinions 
that were encoded all the while as conditional probabilities in her origi- 
nal probability distribution. There is a sense in which the probabilities 
in her most prior probability distribution are incorrigible; the scientist 
is forever trapped in this distribution, moving from point to point 
within it but never revising it. She is enslaved to commitments she un- 
dertook at the inception of her doxastic career, imprisoned by priors 
adopted (as Bas van Fraassen” says) “at her mother’s knee.” That these 
commitments are immune from revision is contrary to contemporary 
epistemological views and to scientific practice. Creative insights in sci- 
ence often occur precisely when a scientist, without receiving any new 
evidence or recognizing any logical relationship that had formerly es- 
caped her, breaks out of what had been her most prior probability dis- 
tribution-say, by deciding to take seriously a hypothesis to which she 
had been giving an exceedingly small (or zero) most prior probability, 
or to regard a hypothesis as capable of being confirmed by a class of ev- 
idence that she had formerly considered irrelevant to it.” 

For instance, imagine a physicist of 1905 who is re-reading Albert 
Einstein’s new paper “On a Heuristic Viewpoint concerning the Pro- 
duction and Transformation of Light."“ Einstein examines the fact 
that Max Planck’s equation governing the black-body spectrum can 
be derived from the hypothesis that light comes in discrete quantities 
(quanta) rather than continuous waves, and proposes that this equa- 
tion’s empirical correctness be regarded as evidence for any other 
prediction that can also be derived from the light-quantum hypothe- 


= Laws and Symmetry (New York: Oxford, 1989) and “Rationality Does Not Re- 
quire Conditionalization,” in E. Ullmann-Margalit, ed., The Israel Colloqurum Senes 
tn the History, Philosophy, and Sociology of Saence, Volume V (Dordrecht Kluwer, 
forthcoming). 

i crs garage alas “grec E 
room for violations of “confirmational tenacity,” tly terms the perpetual 
grip of our most prior probability distribution (op. a p. ae . (For more on this, see 
section IL) See also aa and Alan Franklin, “Bayesian Couditonalization and 
Probability Kinematics,” British fournal for the of Scsence, XLV (1994): 451-66, 
here p. 459; Charles Chihara, “The Howson-U Proofs of Bayesian Principles,” 
in Eells and B. Skymms, eds., Probability and Conditionals (New York: Cambridge, 
1994), pp. 161-78, here pp. 171-73; and areata Fraassen (op. at). 

4 Annalen der Physik (ser. 4), xvn (1905): 182-48. 
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sis. Suppose that on his first reading of this paper, our physicist 
shared the reaction of nearly all actual physicists in 1905: if the black- 
body spectrum behaves as if light is quantized, this is sheer coinci- 
dence—“nothing more than a curious property of light, without any 
physical significance.”” It is not really a live option that light is quam- 
tized, considering the overwhelming success of the wave theory of 
light in accounting for a wide range of optical and electromagnetic 
phenomena. The light-quantum is just a formal device for deriving 
the correct black-body equation (as Planck" later reported himself to 
have held in 1905). Accordingly, it would be utterly coincidental if 
any other phenomenon were as if light is quantized. So the black- 
body equation’s success does not confirm the other equations (gov 
erning the photoelectric effect, the photoionization of gases, and 
photoluminescence) that Einstein uses the light-quantum hypothesis 
to derive. This, I said, was our physicist’s initial reaction. But sup- 
pose that on re-reading Einstein’s paper, our physicist is more im- 
pressed by certain considerations which he noted before, but which 
(he now believes) he originally failed to accord their proper weight. 
For example, he is struck more forcibly by the paper’s opening 
thought that none of the remarkable successes of the classical theory 
of light (reflection, refraction, diffraction, interference, and so on) 
concerns light absorption or emission, which are central to each of 
the phenomena with which Einstein is concerned (the black-body 
spectrum, photoelectricity, and so on). Although he knew this point 
before re-reading the paper, our physicist decides that he failed then 
to give it due regard. He therefore re-evaluates certain of his opin 
ions-for example, raising his pr(light-quantum hypothesis) and set- 
ting pr(Einstein’s photoelectric-effect equation | Planck’s black-body 
equation) > pr(Einstein’s photoelectric-effect equation), whereas he 
had formerly set these equal. He is not taking into account evidence 
(or even logical truths) that he did not know or take into account be- 
fore, nor is he inventing some new hypothesis. Rather, he regards his 
new probabilities as the nghi ones for him to assign—the ones he 
should have assigned earlier, considering what he knew then. (I shall 
elaborate this idea of rightness in the next section.) 

Now perhaps it is not initially evident how the problem of incorrigi- 
bility is a problem for BC. Our physicist has shifted his opinion, but 
did not discover any new evidence regarding light quantization, 
whereas BC purports to cover only shifts involving new evidence. Yet 


= Abraham Pais, Subtle Is the Lord (New York: Oxford, 1982), p 337 
* From a 193] letter to R.W. Wood, in A. Hermann, ed., Pruhgeschtchis der Quan- 
tentheone, Claude W Nash, trans. (Cambridge: MIT, 1971), pp. 23-24. 
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we might just as well have imagined an example where a scientist’s cre- 
ative insight-her breaking out of her most prior probability distribu- 
tion—occurred together with her discovery of some new evidence, and 
so her shift of opinion falls within BC’s jurisdiction, but violates BC. In- 
deed, our physicist’s shift of opinion with regard to light quantization 
surely happened to be accompanied by his learning something—perhaps 
that the dog is barking—and so he updated his opinions in violation of 
BC. Thus, I believe that it is permissible for updates to violate BC.” 

We might amend BC by explicitly restricting its application to cases 
where we should not break out from our most prior probability distri 
bution. But since Proe A) = prya(Ale) follows from our te ot 
most prior probability distribution (in particular, from pr,(hle) = 
Prag(hle)), BC is trivialized by being so amended; it then says merely 
that we should update by pr,.,(A) = prag(Ale) in cases where we 
should. Instead, we might amend BC by restricting its application to 
cases where we have no reason to depart from our most prior proba- 
bility distribution, and by offering further rules that govern what 
count as good reasons for so departing.” But BC can then account 
only for those shifts of opinion where the agent should remain within 
her most prior probability distribution. This apparently threatens all 
of the familiar Bayesian accounts of various features of confirmation, 
since these accounts will not apply unless the agent must remain 
within her most prior probability distribution, and until we have de- 
veloped some further rules, we cannot explain when this is. I shall ar- 
gue, however, that although our shifts of opinion need not leave us 
within our most prior probability distribution, the steps in an argu- 
ment justifying our current opinion should do so. BC can then be 
used straightforwardly-without amendment-to explicate ʻe is evi- 
dence for k’, thereby preserving the familiar Bayesian accounts intact. 

0 


Here, then, we have four problems that arise when we try to use BC 
to explicate ‘¢is evidence for A’. I shall suggest that these problems 
fail to arise if our explication depends not on BC’s governing how 
we should update our opinions from one moment to the next, in 
light of new evidence, but instead on BC’s governing how we should 
justify our current opinions by appealing to evidence that we have al- 
ready accumulated. In showing an audience how her current degree 
of confidence in h constitutes a justified response to the evidence 


7 Again, here I concur with van Fraassen, op. cit. 

* Levi (op. ct.) allows confirmational commitments to be revised in various ways 
depending upon the context of inquiry. He offers an account of how we ought to 
revise our “confirmational commitments” in certain Important contexts. 
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that she already has on hand, a scientist explains the bearing of vart 
ous pieces of evidence on A, but does not change her opinions in the 
course of doing so. 

She begins her justificatory argument, I suggest, by appealing to a 
probability distribution that other scientists should recognize as un- 
biased in the relevant respect. She is entitled to begin with this dis- 
tribution as a “free move”—that is, without justifying it." She then 
justifies her current degree of confidence in A by invoking various 
pieces of evidence that have already been discovered, and by using 
them successively to “update” the initial, unbiased opinion until all 
of the relevant evidence has been considered, at which point her 
current degree of confidence in A should be reached. Once again, | 
put scare-quotes around ‘update’ because the scientist is not chang- 
ing her degrees of belief in the course of this exercise. Rather, she is 
Justifying the opinion that she was already holding when she began 
to demonstrate that she was entitled to hold it (In the next section, 
I shall argue that this “updating” must accord with BC.) 

For instance, consider how a weather forecaster might justify her 
current 80% confidence in the hypothesis that it will rain and not 
snow tomorrow. She begins her justificatory argument with an “un- 
biased” probability distribution, which (let us suppose) assigns 20% 
confidence to ‘It will rain and not snow tomorrow’ (hypothesis A,), 
10% confidence to ‘It will rain and snow tomorrow’ (h), 5% conf- 
dence to ‘It will snow and not rain tomorrow’ (A;), and so on. In the 
first step of her argument, she submits her first piece of evidence: 
that today’s barometric pressure was 29 torr. This evidence, she ex- 
plains, warrants “updating” her opinions in a certain way; among her 
“intermediate conclusions” at this first step are pr{A,) = .6, pr(A,) = 
.10, and pr(A;) = .01. Today's barometric pressure, then, is evi- 
dence for A, and against A, while it counts neither for nor against A. 
In the second step of her justificatory argument, she takes into ac- 


* To invoke this distribution’s “impartiality” is obviously to issue a giant 
promissory note. Of course, its mmpartiality cannot depend on its somehow ac- 
cording all conceivable competing hypotheses equal weight. Later, I say a bit 
more about umparnality, but to pursue it much further would amount to dis- 
cussing the problem of the priors, which I shall not do. Let me emphasize that, if 
my position ultimately requires an account of what makes some probabulity distri- 
bution able to serve as the initial distribution, so does the view that BC governs 
the revision of opinion in the face of new evidence. I criticized that view as un- 
able, for various reasons, to explicate ‘eis evidence for A’, but not for presuppos- 
ing an answer to the problem of the priors; the criticisms I reviewed would still 
apply even if we had some account of why certain opinions constitute a reason- 

e place to begin one’s doxastic life My purpose here is to find some way to 
preserve BC's apparent successes while avoiding the problems I have mentioned, 
not to resolve the problem of the priors 
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count another piece of the evidence—perhaps today’s temperature~she 
has on hand. She explains the bearing of this evidence on the lead- 


ing hypotheses; among her intermediate conclusions at this second 
step are pr(A,) = .7, pr(A,) = .002, and pr(A,) = .001. Today’s tem- 
perature is evidence for h, and against h, and A; She continues in 
this fashion until she has submitted all of the evidence she has ac- 
quired and deems relevant. For the justificatory argument to suc- 
ceed, she must at this final step have reached pr(A,) = .8, her 
current opinion. 

Changing your opinion upon ascertaining new evidence is ut- 
terly distinct from justifying your current opinion by appealing to 
evidence that you have already accumulated. The justification of a 
scientist's current opinion is not necessarily the history of her ar- 
rival at that opinion, even if she arrived at that opinion justly. For 
instance, the scientist may at one time have revised her opinion re- 
garding the hypothesis by considering new evidence that she later 
decided was false. In the argument that now justifies her degree 
of confidence in that hypothesis, there is no step in which her 
opinion is “updated” in light of this evidence, since she now be- 
lieves this evidence mistaken. Furthermore, her current justifica 
tion may differ from her earlier justification even if the opinion 
being justified and the evidence justifying it are unchanged. She 
may have changed her mind in the meantime regarding whether 
certain observations count as evidence for or against the hypothe- 
sis, or how powerfully they count. Her current justificatory argu 
ment takes each successive piece of evidence into account in 
precisely the manner that she now believes proper. 

The distinction between updating your opinions upon receiving 
new evidence and showing how the evidence on hand justifies your 
current opinions should not be confused with various other, more 
familiar distinctions, such as that between the “context of discovery” 
and the “context of justification.” Admittedly, both distinctions have 
a diachronic side concerned with arriving at something and a syn- 
chronic side concerned with justifying something. A crucial aspect 
of the traditional distinction between the two contexts, however, is 
that when we discuss some hypothesis in the context of discovery, we 
make no normative judgments; we describe how (for example) a 
conversation, a piece of evidence, a dream, or a heuristic procedure 
led to the formulation of the hypothesis. In contrast, both sides of my 
distinction are normative: how you oughi to revise your past opinions 
in response to new evidence versus how your current evidence war- 
ranis your current opinions. The context of discovery includes vari- 
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ous influences, such as dreams, that appear on neither side of my 
distinction, since they play no justificatory role.” 

One way to grasp the role in justificatory arguments that (I shall ub 
timately suggest) BC plays is through the distinctions that Levi care- 
fully draws among the various roles that BC might play in connection 
with updating opinions. Levi presumes that all rational agents adopt 
“confirmation commitments” that specify, for each deductively closed 
set of claims, the opinions that the agent is committed to holding in 
the event that the claims to whose truth she is committed are exactly 
the members of that set (op. at, p. 79). Levi argues that our confir- 
mational commitments at a given moment are constrained by “confir- 
mational conditionalization”—roughly, that we are committed to 
updating our opinions in accordance with BC (op. at, pp. 81, 222). 
Again very roughly: for a consistent expansion of our current full be- 
liefs, formed by adding e to those beliefs and then rendering the re- 
sult deductively closed, “confirmational conditionalization” requires 
that we be committed to our pry,(Ale) being equal to our pr,(A). Lf 
our confirmational commitments are constrained by confirmational 
conditionalization and remain unchanged upon a consistent expan 
sion of our current full beliefs, then our new opinions are dictated by 
BC. Such a shift of opinion is what Levi calls a “temporal credal com 
ditionalization.” But Levi denies “confirmational tenacity” (thereby 
recognizing what I have called the problem of incorrigibility) (op. at., 
p. 82); that is, he denies that a rational agent must always remain 
faithful to her former confirmational commitments, and so carry out 
their dictates on the occasion of (say) a consistent expansion of her 
beliefs in light of an observation. Rather, she may have good reason 
for revising her confirmational commitments.” In that case, her shift 
of opinion, upon a consistent expansion of her current beliefs, will 
not generally be a temporal credal conditionalization.” Now, rather 
than consider an agent’s shift of opinion from one moment to the 
next, let us consider her transitions from one step to the next in a jus- 
tificatory argument. At each step, she has arrived at certain interme- 
diate conclusions along with (let us suppose) certain “confirmational 


* Some authors have argued that normative judgments can be made regarding 
a scientist's means of thinking up hypotheses to entertain. But even if there isa 
“logic of discovery,” BC could play no role in it, since BC operates only on hy 
potheses that have already been formulated. 

u See footnote 18. 

B Levi notes that, even if the agent retains her confirmational commitment, it 
does not follow that all of her shifts of opinion are temporal credal conditionaliza 
tions, since perhaps a legitimate revision may be occasioned by a contraction 
rather than an expansion of her beliefs, or by new evidence that does not mvolve 
learning the truth of any claim. See footnote 10. 
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commitments” that specify, for any claim e that is consistent with all of 
the full beliefs already arrived at by that step, the opinions that repre- 
sent the right way to take ¢ into account, should e be the further evi 
dence submitted in the next step of the justificatory argument. The 
analog of “confirmational conditionalization” requires that at each 
step, the agent’s confirmational commitments be governed by BC, in 
the following sense: if the agent is required to honor her confirma- 
tional commitments in moving to the next step in the justificatory ar- 
gument (the analog of “confirmational tenacity”), and if every 
transition from one step to the next results from taking account of a 
further piece of evidence e that is consistent with all of the full beliefs 
arrived at thus far in the argument, then the analog of “confirma- 
tional conditionalization” requires that the agent’s intermediate con- 
clusion pr,.(4) at the next step equal her intermediate conclusion 
Plaa(Ale) at the current step. This “update” is the analog of a tempo- 
ral credal conditionalization. In effect, I defend these analogs of com 
firmational conditionalization and tenacity, and argue that all 
transitions from one step to the next in a justificatory argument are 
analogs of temporal credal conditionalizations. 

In other words, I have suggested (in connection with the problem 
of incormgibility) that a scientist is not obliged to revise her opin- 
ions, from ons stage in her work to the next, in accordance with BC. Nev- 
ertheless, I shall argue that at each stage in her work, BC must govern 
the steps in the arguments by which her opinions then are justified. 
In that event, BC would be in a position to account for various fea- 
tures of confirmation without encountering the four problems that I 
have mentioned. 

For instance, the problem of old evidence—as a difficulty for using 
BC to explicate ‘eis evidence for A’-is avoided if this explication ap- 
plies BC not to updatings, but to the transitions between the steps in 
justificatory arguments. Although our confidence in A was never up- 
dated in light of e's discovery, since e was discovered before A was for- 
mulated, the argument by which we now justify our pr(A) appeals to 
e. That argument begins with a probability distribution in which Ais 
assigned some nonzero probability and eis assigned a probability less 
than 1 (since this distribution did not take account of any of the evi 
dence to be submitted in the course of the argument). So in the 
context of a justificatory argument, none of the evidence being pre- 
sented for (or against) Ais “old.” 

Notice that this response to the problem of old evidence is similar 
to but distinct from the appeal to counterfactual degrees of belief 
(which I mentioned earlier). A prior probability in a justificatory ar- 
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gument is like a counterfactual degree of belief in that it may be a 
fictitious state of opinion; perhaps at no moment did our opinions 
match this prior probability distribution. As in the appeal to coun 
terfactual degrees of belief, the strategy is to argue that there is a rea- 
sonable prior that does not construe the old evidence as old. But 
unlike the counterfactual approach, the prior probability distribu- 
tion in a justificatory argument is not beholden to the historical facts 
that determine the confidence that we would have placed in the hy- 
pothesis, had it been formulated sometime earlier. These historical 
facts are irrelevant to our current judgments of whether ¢ is evidence 
for A. A given prior probability distribution is relevant to whether e 
is evidence for h not because, in the actual world or some possible 
world, we hold this probability distribution and then raise our confi- 
dence in k when updating our opinions upon discovering e. Rather, 
it is relevant because this probability distribution figures in the argu- 
ments by which we justify our opinion regarding A. 

It might be objected that I have not removed the problem of old 
evidence, but merely presupposed that there is a solution to it-that 
there is a reasonable prior that does not construe the old evidence as 
already known. But this charge is unfair. Once we shift our attention 
from updates to justificatory arguments, the temporal element is re- 
moved, and so old evidence presents no peculiar problem. The old- 
ness of the old evidence presents no special reason for it to be 
depicted by the initial probability distribution in a justificatory argu- 
ment as having already been ascertained; a piece of evidence submit- 
ted in the course of the justificatory argument, whether or not it was 
discovered before the given hypothesis was formulated, was perforce 
found before the justificatory argument was offered. So there is no 
longer any problem of old evidence. Of course, there remains the 
question: Whence comes the initial probability distribution in the jus- 
tificatory argument? But we have not merely exchanged the problem 
of old evidence for this problem, since this problem was already with 
us as a version of the problem of the priors.” 

The second problem I raised—how to use BC to explicate g's serving 
as evidence for (or against) A, if h is already believed (or rejected) by 
the time sis discovered—is again not a problem once we stop trying to 
find an updating to which BC applies, and instead apply BC to the 
step where gis taken into account in a jfustificatory argument. After we 
adopt (or reject) A, we continue to be committed to arguments that 
justify our doing so. Suppose that even before discovering ¢ we justly 
adopted h; we already had sufficient evidence to fund an argument. 


= See foomote 19. 
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justifying belief in A. Then g's discovery may nevertheless enable us to 
give a new argument for believing A, one step of which involves taking 
einto account. If that step involves an increase in our confidence in h, 
then it is correct to say (in connection with this argument) that we 
take ¢as evidence for h. In short, there is only one history of the shifts 
in Our opinions upon discovering new evidence, but we may have mul- 
tiple arguments for our current degree of confidence in A. So even if 
es discovery resulted in no further shift in our confidence in h, 
nonetheless ¢ may count as evidence for (or against) A, so long as this 
is cashed out in terms of ¢'s role in our justificatory arguments. 

The third problem I raised—that BC cannot explain what makes e ev- 
idence against A, if h is initially believed and then e leads us to recom- 
sider—ikewise can be avoided if BC is applied not to ¢s updating our 
confidence in A, but to ds role in an argument justifying our current 
degree of confidence in h. That is because no claim accepted at one 
step in a justificatory argument is rejected at a subsequent step. For 
instance, if a scientist initially accepts a given claim as an observation 
report but later rejects it, she thenceforth ceases to submit that claim 
as an observation report anywhere in her justificatory arguments. She 
does not cite it at one stage in an argument, as evidence relevant to 
various hypotheses under consideration, and then take it back at a 
subsequent stage. As Thomas Kuhn™ emphasizes, the justificatory ar- 
gument in a scientific paper is “cleaned up”; it does not double-back 
on itself in the way that a scientist’s actual doxastic history might. 

Similarly, the problem of incorrigibility is avoided when BC is ap- 
plied to the steps in a justificatory argument. Our 1905 physicist 
changed her mind about whether the correctness of Planck’s black- 
body equation counts as evidence for Einstein’s photoelectric-effect 
equation. In contrast, once having submitted ¢ at some step in a jus- 
tificatory argument as evidence for (or against) A, we cannot at some 
other step in the argument revise our opinion of whether e counts as 
evidence for or against A. The argument, I shall suggest, operates 
within a fixed most prior probability distribution (though we may, of 
course, later reject the argument). For example, suppose you are 
justifying your current 75% confidence in A. After the first few steps 
in your argument, you have taken into account a certain portion of 
your current evidence and reached pr(A) = .4 as your confidence so 
far. After the next few steps, you have taken into account more of 
your current evidence and reached pr(A) = .8 as your intermediate 
conclusion. At this step, you cannot reject your “earlier” degree of 
confidence as having been the wrong intermediate conclusion to 


™ The Essential Tension (Chicago: University Press, 1977), p. 327. 
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have reached from the evidence under consideration at that “ear- 
lier” step. Since the intermediate conclusions at “earlier” steps in a justifica- 
tory argument are the basis for the intermediate conclusions at “later” steps, a 
“later” step cannot mvolve the reection of an “earlier” intermediate conclu- 
ston as having been an wnappropriate conclusion to have reached from the ev- 
idence under consideration at that step. The opinions appearing as 
intermediate conclusions at a given step of the justificatory argu- 
ment must be regarded throughout the argument as having been mer- 
ited by the evidence taken into account at, or prior to, that step. 

I shall now try to spell out this view by using an argument from cal 
ibration to show that the steps in a justificatory argument must ac- 


cord with BC. 
mi 


A standard way of arguing for BC is by a Dutch Book argument. A 
Dutch Book is a series of bets, each bet deemed fair by the bettor at the 
time she makes it, such that the bettor will lose money no matter what 
the outcomes of the events that are the subjects of these bets. In a syn- 
chronic Dutch Book, the bettor makes all of the bets at once; in a d+ 
achronic Dutch Book, she makes them at different times. The idea is 
that, if the bettor’s degrees of belief regarding the outcomes of various 
events are such that the bookmaker can design bets on these out 
comes which the bettor will take (that is, that she will deem fair) but 
which must necessarily result in the bettor’s suffering a net loss, then 
the bettor must have made a logical error in the management of her 
opinions. It is required that the bookmaker be able to design the 
Dutch Book even though the bettor and bookmaker know exactly the 
same facts. For instance, if the bettor uses a rule for updating her 
opinions upon receipt of evidence, then since she must know the rule 
she uses, the bookmaker must know that rule, too, and if the bettor 
does not know the outcomes of the events on which she has bet, then 
the bookmaker must not know them either. (Obviously, if the book- 
maker knows the outcome of the event on which the bettor is betting, 
and the bettor does not, then it signifies no logical flaw in the bettor’s 
management of her opinions that the bookmaker can construct a bet 
which the bettor believes fair but from which she cannot profit.) It 
was shown (by John G. Kemeny, R. Sherman Lehman, and Abner Shi 
mony, with key contributions from Frank Ramsey and de Finetti) that 
you are vulnerable to a synchronic Dutch Book if and only if your de- 
grees of belief at a given time violate the axioms of the probability cab 
culus, and it was shown (by David Lewis) that you are vulnerable to a 
diachronic Dutch Book if and only if you update your degrees of be- 
lief by using a rule other than BC. 
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But we cannot use a diachronic Dutch Book argument to show 
that the steps of a justificatory argument must conform to BC. We 
might worry that a Dutch Book scenario is not an apt metaphor for 
an agent's offering a justificatory argument, since an agent does not 
receive new evidence and revise her opinions in the course of justify 
ing her current opinions. But even setting this worry aside by treat- 
ing the “updates” in a justificatory argument as like diachronic shifts 
of opinion, there is a further problem: a Dutch Book argument can- 
not show that we must update according to BC, since we can always 
frustrate the bookmaker by using no rule at all to update our opin- 
ions. The Dutch Book scenario presumes that, if we do not know in 
advance how we would revise our opinions under various circum- 
stances, then the bookmaker does not know this either, and so can- 
not construct a Dutch Book against us. Using no rule is not an empty 
threat; we saw our 1905 physicist use no rule in changing his mind 
about whether the success of Planck's black-body equation confirms 
other equations derivable from the light-quantum hypothesis. Ac- 
cordingly, van Fraassen” maintains that, although a Dutch Book ar- 
gument demonstrates that the only rational rule for updating is BC, 
it permits us rationally to depart from BC so long as we employ no 
rule at all for updating our opinions. Why, then, is a justificatory ar- 
gument unsuccessful if its steps depart from BC? 

Let us begin with a logically prior question. Suppose we present a 
justification of our current degree of confidence in a hypothesis, pro- 
ceeding from an “unbiased” most prior distribution by a sequence of 
updates. Each update responds to one.piece of evidence that we 
have accumulated, and the sequence of updates continues until all of 
the evidence on hand that we deem relevant has been considered. 
Why does this exercise possess the power-to justify our current opin- 
ion? Why do we succeed in justifying our current opinion by showing 
that it is guided by the degrees of confidence in the prior step, which 
are guided by the degrees of confidence m the yet prior step, and so 
on back to the “unbiased” most prior probabilities? Why does guid- 
ance by those intermediate conclusions and initial opinions matter? 

It matters, I suggest, because we believe the intermediate conclu- 
sions and initial opinions to be “perfectly calibrated.” Consider a 
weather forecaster who has 80% confidence that it will rain tomorrow. 
Suppose that she bases this opinion on the fact that it rained today, 
the high temperature today was 60 degrees, and so on. Any other day 
where it rains on the previous day, the high temperature on the previ 
ous day is 60 degrees, and so on—in other words, any other day that is 


= Laws and Symmatry, p 174; “Rationahty Does Not Require Conditonalization.” 
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similar to tomorrow, in all of the respects that grounded the forecast 
er’s opinion regarding rain tomorrow-I shall term releventh simlar to 
tomorrow. Suppose that the weather forecaster knows regarding 
some other day only that it rained on the previous day, the high tem- 
perature on the previous day was 60 degrees, and so on. Then the 
weather forecaster should have 80% confidence that it will rain on 
that day, since she has exactly the same reasons in the two cases for be- 
lieving that it will rain. In assigning 80% confidence to the claim that 
it will rain tomorrow, the forecaster is perfectly calibrated if and only if 
it rains on exactly 80% of the days that are relevantly like tomorrow. If 
it rains on more than 80% of those days, then the forecaster’s subjec- 
tive degree of belief is too low; if it rains on less than 80% of those 
days, then her opinion is too high. If she is perfectly calibrated, then 
her opinion is nght; it is just what it ought to be, considering her rea 
sons for it (that is, her sense of relevant similarity) .™ 

In a justificatory argument, we base our intermediate conclusion 
‘pr(4) = 7 on certain considerations that we have already submitted 
by that step. Suppose that we consider some other hypothesis k, dis- 
cussed at that step, to be relevantly similar to A, in that we take the 
considerations on which we base our intermediate conclusion ‘pr(f) 
= s at that step to be relevantly similar to those on which we base 
‘pr(A) = r at that step. That is, we take ourselves to have exactly the 
same kinds of reasons for our degree of confidence in 4 as we do for 
our degree of confidence in A. (Accordingly, r= s) The intermedi 
ate conclusions at a given step are “perfectly calibrated” exactly when 
for each of these conclusions ‘pr(h) = 1’, ris the fraction of true hy- 
potheses among the hypotheses that at this step we deem relevantly 
similar to h. I suggest (as a first approximation) that we believe the 
intermediate conclusions at a given step of a justificatory argument 
to be perfectly calibrated, since this would explain why we believe 
that we ought to be guided by those conclusions when we move to 
the next step, taking some further evidence into account. (In a mo- 
ment, I shall examine just how we should be so guided.) 

Let me elaborate. Suppose we have no confidence in the calibration 
of a weather forecaster’s predictions. For example, she has 80% confi- 
dence that it will rain tomorrow, but our opinions regarding the frac- 


~ For a similar conception of an opinion’s rightness, see van Fraassen, “Calibra- 
ton: A T for Personal Probability,” in RS. Cohen and L. Lau- 
dan, eds., Physics, Philosophy, and Prychoanabsis (Boston: Reidel, 1988), pp. 295-319, 
here p. 301; Lams and Symmetry, p. 177. I shall qualify this idea shortly. As van 
Fraassen emphamzes, this conception of an opinion’s nghtness does not suggest 
that our subjective degree of confidence in rain tomorrow is actually to be under- 
stood as a claim concerning the objective chance of rain then. 
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tion of rainy days among the days that she takes to be relevantly like to- 
morrow are no different from our opinions regarding the fraction of 
rainy days among the days that she takes to be relevantly like the day af- 
ter tomorrow, even though she has 80% confidence that it will not rain 
on that day. Then we do not take her forecasts seriously; we give them 
no weight in guiding our opmion regarding whether it will rain.” Ina 
justificatory argument for our current opinion, by contrast, each step’s 
intermediate conclusions are guided by the previous step’s; we take 
those intermediate conclusions seriously when we take some further ev- 
idence into account. If we change our minds about the right interme- 
diate conclusions to draw at a given step, then we change that step, 
adjusting the conclusions there accordingly. We do not retain the step 
with the intermediate conclusions that (we now judge) fail to accord 
certain considerations their proper weight, and add to the argument a 
further step to negate the effect of these conclusions on subsequent 
steps. To include a step where no additional evidence is submitted, but 
new intermediate conclusions are drawn to replace the old, would be 
in effect to start the argument over in the middle by setting aside the 
intermediate conclusions drawn in the earlier step as having been mis- 
taken there. (I present an example of this in a moment.) The simplest 
rational reconstruction of why the intermediate conclusions in a justif- 
catory argument are taken seriously throughout the argument, each 
step’s guiding the next step’s, is that the agent believes each step’s in- 
termediate conclusions to be perfectly calibrated.” 

In contrast, we can rationally change our opinions from one moment 
to the next without ascertaining any new evidence. When we decide 
that our earlier opinions failed to accord certain considerations their 
proper weight, we decide that those opinions were not the right ones 
for us to have held, considering the evidence we then knew. We are 
taking into account no evidence (or even logical truth) that we for- 
merly failed to take into account; rather, we are changing our minds 
about what it takes for us to take proper account of some considera- 
tion that we already knew. 

So the key difference between updatings from one moment to the 
next and “updatings” from one step in a justificatory argument to the 


7 For a similar view, see Maunce Fréchet, “Les Définitons Courantes de la Prob- 
abilité,” Reows Phalasophzqus, CXXXVI (1946): 129-69, here p. 144. 

= Here is a less simple but perhaps more realistic alternative. Rather than re- 
quire that the agent believe rto be the fraction of true hypotheses among the hy 
potheses relevantly like A, require that r be the agent's expectation valus for that 
fraction. That is, require that r= Epr(r)r, where pr(r) is the agent's subjective 
probability that r, is that fraction. This modification is compatible with the subse- 
quent argument, seulahs mutandis. 
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next is that in the justificatory case, but not in the diachronic case, we 
must use the earlier opinions to guide their revision; we deem the later 
opinions to be right on the basis of deeming the earlier opinions to have 
been right. If we decide that some intermediate conclusion was not 
right, then we change that step, giving ourselves a new Justificatory argu- 
ment. But ifwe change our minds about whether our opinions at an ear- 
lier moment accorded proper weight to the evidence then known, we 
cannot go back and change which opinions we held in the past. 

To see how the later steps of a justificatory argument are sup- 
posed to be guided by the earlier steps, so that the earlier steps 
must be considered throughout the argument to have been right, 
consider this example. Suppose that a weather forecaster sets out 
to justify her 80% confidence in rain tomorrow. She begins her ar- 
gument with an “unbiased” most prior probability distribution. 
She then considers a piece of evidence: that today’s barometric 
pressure is 29 torr. She assigns tomorrow to a reference class: the 
set of days each of which is preceded by a day with barometric pres- 
sure close to 29 torr. She judges the cases belonging to the same 
reference class to be “relevantly similar,” that is, similar in all of the 
respects on which she bases her intermediate conclusion at this 
step regarding whether it will rain tomorrow. Accordingly, in the 
intermediate conclusions at that step, she assigns the same degree 
of confidence to rain on each of these days—let us say 50%. She 
then considers another piece of evidence: whether or not it rained 
today. She assigns tomorrow to a narrower reference class (taking 
a different set of cases to be relevantly similar) and accordingly 
adopts a new intermediate conclusion. Now suppose that, if her 
second piece of evidence is that it rained today, the weather fore- 
caster is prepared to make her second step’s intermediate conclu- 
sion ‘pr(rain tomorrow) = .2’, and if her second piece of evidence 
is that it did not rain today, the weather forecaster is prepared to 
make her second step’s intermediate conclusion again ‘pr(rain to- 
morrow) = .2'. This would be tantamount to her changing her 
confidence in rain tomorrow from one step to the next without any 
additional evidence as motivation; plainly, she is not motivated by 
taking into account that it did rain, as the opposite discovery would 
have made no difference to her second step’s intermediate conclu- 
sion, and she is not motivated by taking into account that it either 
rained yesterday or it did not, as this is a trivial truth and so does 
not count as new evidence. To go in this way from 50% to 20% 
confidence in rain tomorrow is to fail to take the first step’s inter- 
mediate conclusion seriously; she is not being guided by her first 
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step’s intermediate conclusions in reaching her second step’s. 
That is not the way a justificatory argument proceeds. 

The same flaw occurs even if the two possible intermediate conclu- 
sions (‘pr(rain tomorrow) =...') in the second step are not identical, 
if they are both greater than .5 (the first step’s intermediate conclu 
sion) or both less than .5. In either case, the agent is not taking the 
first step’s intermediate conclusion seriously. For instance, if both of 
the second step’s possible intermediate conclusions assign less than 
50% confidence to rain tomorrow, then she cannot regard it as being 
right to assign 50% confidence in the first step; she was prepared to 
lower her pr(rain tomorrow) in the second step no matter what the ev- 
idence concerning rain today, so she must regard the first step’s 50% 
assignment as excessively high. This logical flaw is manifested in the 
fact that the agent cannot with consistency regard her first step’s inter- 
mediate conclusions as perfectly calibrated and her second step’s in- 
termediate conclusions as perfectly calibrated. For if rain occurs in 
less than 50% of the days following 29-torr rainy days, and occurs in 
less than 50% of the days following 29-torr nonrainy days, then it cam 
not be that rain occurs in 50% of the days following 29torr days.” In 
failing to take her intermediate conclusions seriously, the weather 
forecaster betrays her belief that they are not calibrated. 

Let me elaborate the notion of a “reference class” as it pertains to calt 
bration. For me to believe my intermediate conclusion ‘pr(A4) = r’ to be 
perfectly calibrated is for me to believe that ris the fraction of truths 
among the hypotheses that in this step I believe “relevantly similar” to A 
(and so to which I assign the same degree of confidence as I do to A). 
Thus, my belief in the perfect calibration of my intermediate conclusion 
is meaningful only if there is a meaningful sense of hypotheses “rele 
vantly like” A. Readers may now recall the notorious problem of the reference 
class (or problem of the single case) as it afflicts various frequentist analyses of 
probability: If ‘the probability of this coin toss landing heads is 50%’ just 
means that 50% of a large collection of tosses like this one would land 
heads, what count as tosses Ake this one? Despair over the reference-class 
problem has often been cited as an argument against any such analysis of 
probability. But Iam not offering an analysis or interpretation of the prob- 


ability assignments in justificatory arguments. The opinions that we be- 
lieve perfectly calibrated are not opinions about the fraction of truths 


» My flawed weather forecaster is intentionally similar to van Fraassen’s Piero in 
“Belief and the Problem of Ulysses and the Sirens,” Philosophical Studies, LXXVO 
(1995): 7-37, here p. 15 His example involves a change in opinion from one mo- 
ment to the next rather than from one step to the next m a fustificatory argument. 
What van Fraassen calls the forecaster’s lack of “integrity” I elaborate as her failure 
to take seriously her first step's intermediate conclusion. 
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among relevantly similar hypotheses; they are subjective probabilities. 
Moreover, the notion of calibration that I am employing is not afflicted 
by any problem of the reference class. The hypotheses that I take in this 
step to be “relevantly similar” to Aare exactly the hypotheses that are sim- 
ilar to A in all of the respects on which I base my intermediate conclu- 
sions at this step. In other words, they are the hypotheses that I take 
myself at this step as having the same sorts of reasons for believing as I do 
for believing A. For the hypothesis of rain tomorrow, it is easy to imagine 
what it might take at a given step for a hypothesis to be relevantly similar. 
For another sort of hypothesis, it may be more difficult to characterize 
the reasons we have for assigning to it the degree of confidence we do.” 
But to make my point, I do not need to offer an account of these rea 
sons. I need presume only that we must have reasons for assigning to hy 
potheses the degrees of confidence we do (even if we have not explicitly 
identified these reasons). Unlike the frequentist interpretations of prob- 
ability I mentioned, talk of perfect calibration does not presuppose some 
objectrve sense of two hypotheses being “relevantly similar.” The charac- 
teristics by which we are guided in confirming the hypothesis determine its 
reference class. We take ourselves as being guided by some factors or 
other (which, once again, we may be able to express in only a rough and 
ready fashion). I return to this point in the next section.” 


* Notice that I need not regard every hypothesis to which (at this step) I asngn the 
same degree of confidence as I assign kto be “relevantly similar” to A. (van Fraassen 
makes the same point in “Calibration: A Frequency Justification,” p. 303; see also the 
next footnote.) I may consider my reasons for assigning rto a hypothess predichng 
ram to be quite different from my reasons for assigning r to a hypothesis predicting 
that a hearttransplant patient will survive for five years after his transplant. 

= van Fraassen offers one way to cash out this notion of the reference classes in terms 
of which our calibration ts assessed. In “Calibration: A Frequency Approach,” he says 
(pp. 302-05) that the reference class for the cham ‘x s A’ (perhaps ‘22 May 1998 will be 
rainy in Seattle’) consists roughly of ‘y is A; ‘zis A’, and so on, where for every property 
B, our pr(x i B) = pr(y a B) = pr(z ss B), and 3o on By restricting the reference class 
of ‘x as A’ to to claims attnbuting A to other entities, van Fraassen prevents the reference 
Class for ‘x és A’ from consisting of ‘x s A’ along with arbitrarily many claims of the form 
"The ath toss of the die lands ace’. This is mportant for the following reason: 

[Sluppose that I first state my probabilty for rain as 1/6 and then you ask me 
about one thousand tosses of a fair die for the probability of ace and I say 1/6 
each ome. On the total set of 1001 questions, my personal probability will prob- 
ably be quite well calibrated, but that reveals nothing about the reasonableness 
of my initial judgment about rain To see the problem in acute form, let this 
first judgment be replaced by two: adding to it also the judgment that the proba- 
bility of there being no rain equals 1/6 as well. Calibration on the total of 1002 
propositions will be quite good, whereas there is something drastically wrong 
with my probabilities for the first two. 

So the possibility of ever better calibration which we require, must be on ex- 
tensions of the initial set of propositions which are in a relevant sense hke the 
original ones (pp. 304-05). 

van Fraassen’s thought seems to be that a rational agent must have the same res- 
sons for her opinion regarding 7 as for her opinion regarding ‘x u A’ if 7 attributes 
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I have suggested that an argument deriving our current degree of 
confidence in a hypothesis from an “unbiased” most prior distribution, 
by a sequence of updates that eventually takes all of our evidence into 
account, possesses the power to justify our current opinion because we 
are entitled to believe that this argument’s intermediate conclusions are 
perfectly calibrated. We are entitled—without having to submit any argu- 
ment-to believe the most prior distribution to be perfectly calibrated; 
that this is a free move is part of what it is for the most prior distribution 


A to some other entity y and tf she has the same confidence in ‘y 15 A’ asin 'x is A’, 
the same confidence in ‘y1s B’ as in ‘x 1s B’, and so on van Fraassen refuses to cash 
out the reference class by appealing directly to our having the same reasons for our 
confidence in 7 as for our confidence in ‘x zs A’; he is willing to appeal only to fea- 
tures of our probability distribution. 

But this self-imposed limitation leads van Fraassen to resort to the heavy-handed 
measure of requiring the reference class for ‘x ts A’ to consist exclusively of clams 
attribuung A to other entities. It is not at all clear to me how this analysis applies 
when the claim in question is not of the monadic form ‘x 1s A’-for example, when 
the claim is quantfied. How do we assess whether a nineteenth century physicist’s 
opinion of Newton's gravitationalforce law is calibrated? 

I am not as squeamish as van Fraassen 1s about carving out h's reference class by 
appealing directly to our reasons for holding our opinion regarding A. | have no 
way to read these reasons off of our probability distribution Analyzing what it 
means to say that our opinion regarding A is based on a given consideration is a 
classic epistemological problem. See, for instance, Keith Lehrer, Knowledges (New 
York: Oxford, 1974), and George Pappas, “Basing Relations,” ın Pappas, ed., Austr/i- 
cation and Knowledge (Boston: Reidel, 1979), pp. 51-63. I assume nothing about 
what our reasons are, only that we must have reasons. (See section Iv.) They pre- 
sumably relate to our probability distribution ın something like the depts, able 
Our reasons for assigning a certain degree of confidence r to h consist of our belief 
that A possesses various properties a, f,..., where these properties could logically 
possibly hold of any number of (logically inequrvalent) clai Let I’ be a dummy 
variable standing for a claim. Then consider our pr(I iT possesses a, §,..). In 
other words, for an unidentfied cham, this is our degree of confidence in it, given 
various facts about it. Soha mehi be tech E sayi iat it wilting oa acera 
day, this day follows a day where the high temperature was 80 degrees and the 
barometric pressure was 30 torr,...). Our belief that 4 possesses a, 8, can be our 
reason for assigning r to A only if our Pa I possesses a, f,...) = r, and for any 
further claim g that we believe, our pr(T ig, T possesses a, f,...) =r. (For some 
general emardi in a similar spirit, see Wesley Salmon’s discussion of the kinds of 
properties the possession of which by a hypothesis can serve as a basis for a plaus- 
bility ju ent-in The Foundations of Scenitfic Inference (Pittsburgh: University 
Press, 1967), pp. 124-26.) 

The above constraint on A’s reference class ensures, for instance, that for any kin 
that clam, pr(k) = pr(A). This is also a feature of van Fraassen’s account But this 
constraint does not preclude the reference class of '22 May 1998 will be rainy in Seat- 
tle’ from conusting of arbitranly many clans of the form “The ath toes of the die 
lands ace’. I have presented no logwal bar to someone’s having the following reason 
for asugning a degree of confidence of 1/6 to 7: either j concerns rain in Seattle on 
22 May 1998 and the high temperature on TE eiee 80 degrees,..., ory 
concerns a fair die landing ace. We presumably different reasons for our opin- 
ion regarding the rain claim, however, than for our confidence in the die claim. (To 
say just what makes this so, we would need to identify the “basing relation.”) 
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to be “unbiased.”™ Each step of the justificatory argument takes ac- 
count of another piece of evidence by revising the probability distribu- 
tion from the prior step so that it is logically possible for the prior step’s 
intermediate conclusions and the new step’s intermediate conclusions 
both to be perfectly calibrated. The agent sees herself as in each step 
taking account of some additional piece of evidence so as to preserve in 
her new intermediate conclusions the perfect calibration that began 
with her most prior probability distribution. She justifies her current 
opinion by showing herself entitled to take her current opinion to be 
pertectly calibrated—by showing how her current opinion results from 
calibration-preserving updates going back ultimately to her most prior 
opinions, whose calibration she is entitled to take as a working assump- 
tion.™ I think that this picture explains why such an argument possesses 
the power to justify our current opinion regarding a hypothesis. For if 
we have shown ourselves entitled to believe a probability distribution to 
be perfectly calibrated, then (I argued) we have shown ourselves enti- 
tied to believe those degrees of confidence to be night, and so we have 
shown ourselves entitled to hold those degrees of confidence. When 
our 1905 physicist decided that he had previously failed to assign certain 
hypotheses the degrees of confidence they had merited considering the 
success of Planck's black-body equation, he was deciding that those ear- 
lier opinions had not been perfectly calibrated. 

Note that I have argued neither that our current opinions nor that the 
intermediate conclusions in a justificatory argument must be perfectly cal- 
ibrated in order for them to be justified. I have argued only that we must 
justly bekeve our current opinions to be perfectly calibrated in order for us 
to be justified in holding them, and that we justify so believing by show- 
ing how those opinions result from calibration-preserving updates, ulti- 
mately of a prior distribution that we are automatically entitled to take as 
calibrated. For us to be justified in believing certain opinions to be cali- 
brated, it is not necessary that they actually be calibrated, but (given logi- 
cal omniscience) their calibration must be logically possible. 

Perfect calibration is nearly but not quite the sense of rightness appro- 
priate for the intermediate conclusions at a given step of a justificatory ar- 
gument. Suppose that at a step where ‘pr(h) = .8’ is an intermediate 
conclusion, we discuss only one other hypothesis that we deem to be rele- 
vantly similar to A. It is logically impossible for exactly 80% of these two 
hypotheses to be true (since 80% of 2 is not an integer). So our interme- 


™ For any opinions consistent with our evidence, there are surely some prior prob- 
abilities that would yield these opinions via Bayesian updating on our evidence. But 
there is no guarantee that these prior probabilines would be “unbiased ” 

= In my examples, the agent shows this by presenting a sapuence of updates, 
though she could in principle do so by a sengle update on all of her evidence 
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diate conclusions at this step cannot be ( justly believed to be) perfectly 
calibrated. Clearly, this is not from any shortcoming in those opinions; it 
is merely a technicality. To elaborate the “rightness” of these intermedi 
ate conclusions, we need what van Fraassen“ calls “potential perfect cali 
bration.” Consider again the weather forecaster. Each day she assigns 
the next day to a certain reference class based on the current barometric 
pressure, temperature, and so forth. She judges the cases belonging to 
the same reference class to be “relevantly similar,” that is, similar in the 
respects on which she bases her opinion regarding whether it will rain 
the next day. Accordingly, she assigns the same likelihood r of rain the 
next day to each of the co-classified cases. Now take the actual cases 
(days) belonging to a given reference class, some fraction fof which are 
rainy days. Consider what would happen to fwere hypothetical further 
cases, relevantly like the actual cases, to be added. Consider what the 
limiting relative frequency would then be of rainy days among days 
where the forecaster’s meteorological data fall within the relevant range. 
The forecaster’s degrees of confidence are “potentially perfectly cali 
brated” exactly when for each reference class, fwould approach arbitrar- 
ily close to r were the number of cases in the reference class to increase 
without bound. In an analogous way, we can specify what it is for the in- 
termediate conclusions at a given step of a justificatory argument to be 
“potentially perfectly calibrated.” They are potentially perfectly cali 
brated if and only if for any hypothesis k and number r, if ‘pr(A) = r is 
an intermediate conclusion at that step, then among hypotheses deemed 
at that step relevantly like A and accorded a degree of confidence (also 7) 
at that step, the fraction of truths would approach arbitrarily close to r 
were there sufficiently many such hypotheses.” 

As van Fraassen™ explains, it is logically possible for one’s opinions at a 
given time to be potentially perfectly calibrated only if those opinions sat- 
isfy the probability calculus. To glimpse the idea behind his argument, 
notice that, if we assign greater than 100% confidence to A, then it is logi 


H “Calibration: A Frequency Justification,” pp. 304-05; “Belief and the Will,” this 
JOURNAL, LXXXI, 5 (May 1984): 235-56, here p. 245. 
s Pe diee s concept of the limiting relative frequency of truths among rele- 
theses raises a host of familtar worries (such as how hypotheses 
n rip iduated, how the counterfactual ‘Had there been more relevantly 
simular cases’ is to be understood, whether the limiting relative frequency depends 
on the order in which the relevantly similar cases are considered, and so on)—fa- 
miliar since analogous worries arise for certain frequentist views of objectrve proba- 
bility. I shall not investigate here whether these worries apply to the use I wish to 
make of potential perfect calibration, or how they might be addressed. 
™ “Calibration: A Frequency Approach”; “Belief and the Will,” pp. 24546 See 
also Shimony, “An Adamite Derivation of the Calculus of Probability,” in J.H Fet- 
rer, ed., Probabihty and Causality (Dordrecht: Reidel, 1988), pp. 151-61. 
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cally impossible, no matter how many hypotheses we are asked about 
that we deem relevantly like A, for more than 100% of these hypotheses 
to be true. Likewise, if we assign 60% confidence to h and 60% confi- 
dence to —A, then again, it is logically impossible, no matter how many 
hypotheses we are asked about that we deem relevantly like h (and so 
whose negations we deem relevantly like —A), for the fraction of truths 
among the hypotheses like A to approach 60% and the fraction of truths 
among the hypotheses like =A to approach 60%, since that outcome 
would require that certain hypotheses and their negations both be true.” 
I shall apply van Fraassen’s argument to our opinions at a given step 
in a justificatory argument rather than to our opinions at a given mo- 
ment. The argument then shows that it is logically possible for the in- 
termediate conclusions at a given step of a justificatory argument to be 
potentially perfectly calibrated only if they satisfy the probability calcu- 
lus. This constitutes a synchronic calibration argument. I shall now 
give a diachronic argument—again, not in the sense of applying to a 
shift in opinion from one moment to the next, but to a shift in opinion 
from one step in a justificatory argument to the next, as an additional 
piece of evidence is taken into account. Suppose pr(-) is our probabil- 
ity distribution at a given step in a justificatory argument, and suppose 
pr (-) is our probability distribution at the next step in that argument, 
where ¢ is the additional evidence that is brought to bear. Suppose 
pr(-) is potentially perfectly calibrated. I now show that it is logically 
impossible for pr’(-) and pr( ) both to be potentially perfectly cali 
brated unless pr’ (-) results from updating pr(-) according to BC. 


P van Fraamen guarantees that the negation of a claim in A's reference class is in 
—A's reference class: the reference chess for ‘x ts A’ consists of claims of the form ‘...2 
~Á’, and if y is such that ‘y is A’ 1s in the reference class for ‘x ıs A’, then for every 
property B, our pr(x s B) = pr(y is B), and so the reference class for ‘x 1s 4A’ includes 
‘y1s A’. Does the scheme I sketched in footnote 31 require this sort of coordination 
between the reference chasses for A and ) where 7 = —A? It presumably would, once 
some account was added of what it means for an opinion to be “based” on a given 
consideration. But this coordination would be required only when we know all of the 
relevant logical truths, smce only then could our reasons for our confidence in j be 
our reasons for our confidence in A along with the fact that ) = —A, (In other words, 1f 
‘T possesses cc’ distinguishes A’s reference class, then T possesses 6’ can distinguish 
Ja, where p is the property of being logically equivalent to the negation of a dam pos- 
“inthe I shall shortly discuss this point further in the main text. 

ee ieee ee ee 

ommniscience is presumed (in order for various claims’ reference chasses 

ates “coordinated” as above). This seems reasonable, since in the absence of logical 

omnuscience, it is unclear that coherence is required. I assume logical omnisclence 

out. But it is interesting to note that on my conception of the reference chisses, 

an agent coukd be calibrated even if she is ignorant of some relevant logical truths. She 

might, for mstance, assign various numbertheoretk: hypotheses to various reference 

classes, depending on the reasons behind the degrees of confidence in (0,1) that she 
currently assigns them. There room here to treat the confirmation of logical truths. 
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Suppose that pr(-) is potentially perfectly calibrated, so pr(h) 
equals the limiting relative frequency of truths among the hypothe- 
ses like A in the respects relevant to us at the given step. Roughly 
put, pr(h) is the value approached by the fraction of truths in a large 
set S of hypotheses each of which is like Ain the respects relevant to 
us at this step. For pr’(-) to be potentially perfectly calibrated, 
pr’ (A) must equal the value approached by the fraction of truths in a 
large set of hypotheses each of which is like hin the respects relevant 
to us at the next step in the justificatory argument. But these re- 
spects are exactly the respects that were relevant to us at the previous 
step plus which e holds; the hypotheses like hin the respects relevant 
to us at step (n+1) are exactly the hypotheses for which evidence 
like ¢ holds and which are like A in the respects relevant to us at step 
(n). So for pr’(-) to be calibrated, pr’(A) must equal the value ap- 
proached by the fraction of truths among those hypotheses in $ for 
which evidence like e holds. Since pr(-) is potentially perfectly cali- 
brated, pr(A & e) is the fraction, among hypotheses in S, of truths for 
which evidence like e holds, while pr(e) is the fraction, among hy- 
potheses in $, of hypotheses for which evidence like e holds. That is, 


# of hypotheses in S that are true 
'(h) = and for which there is evidence like e 
pr # of hypotheses in S for which there is evidence like ¢ 
# of hypotheses in Sfor which there is evidence like e 
Dl) = E T OE E 
# of hypotheses in $ 
# of hypotheses in S that are true 
and for which there is evidence like ¢ 
# of hypotheses in S 


in the limit of an arbitrarily large S. Hence, pr(A & ¢)/pr(e) = pr’(A). 
But by definition, pr(h & e)/pr(é) = pr(hlà. Therefore, pr’ (A) = pr(Ale). 
Thus it is shown that a justificatory argument must proceed by BC. 

This reasoning nearly convinces me. The main source of my hesita- 
tion is that this argument depends on ¢'s reference class “meshing” with 
h's. That is, since pr(-) is potentially perfectly calibrated, pr(e¢) is the lim 
iting relative frequency of truths among claims relevantly like e. Is part of 
what makes a claim relevantly like e that there is a corresponding hypoth- 
esis relevantly like A? The above argument presumes so, since it takes 
pr(e) to be the fraction, among hypotheses in S, of hypotheses for which 
there is evidence like e But perhaps this presupposition is implausible. 


pr(h &e) = 
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Is some claim made relevantly like ¢ partly by its relation to a hypothesis 
like 4 in the same way as two days are rendered relevantly similar, for the 
weather forecaster, partly by their each following days whose barometric 
readings are relevantly alike? Could a claim be relevantly like ¢ but corre- 
spond to no hypothesis like A, and even if it did correspond to such a hy 
pothesis, could this fact play no part in fixing its reference class? 

But perhaps this “meshing” of h’s and e's reference classes is exactly 
right. In the above argument, A's reference class in the step where eis 
taken into account seems clearly to consist of those hypotheses in S for 
which evidence like e holds. Just as the previous day’s barometric read- 
ing helps to fix the_reference class to which a given day belongs, so in 
the later step of the justificatory argument, A’s being associated with evi- 
dence like e helps to fix h’s reference class. As gis bound up in A’s ref 
erence class, why should A not be bound up in gs? Just as the weather 
forecaster determines the likelihood of rain on a given day by assigning 
that day to a reference class of relevantly similar days that she has expe- 
rienced, so the scientist might determine the likelihood of making ob- 
servation ¢ by noting that ¢ is strongly suggested by hypothesis A and 
thereby assigning ¢ to a reference class of claims that are all strongly 
suggested by hypotheses like A. Notice that the same coordination of 
reference classes was required by the argument given earlier that, if we 
assign 60% confidence to k and 60% confidence to —A, then it is not 
logically possible for us to be potentially perfectly calibrated. That ar- 
gument required that the hypotheses relevantly similar to ~A be the 
negations of the hypotheses relevantly similar to A. This seems correct. 

If the reference classes must mesh in this way, then I have shown 
that the steps in a justificatory argument must conform to BC. In this 
role, I have argued, BC can be plugged into the familiar Bayesian pro- 
posals for explicating various features of confirmation in science. No- 
tice, again, that no analogous argument can be made that scientists 
must accord with BC in updating their opinions from one moment to 
the next, upon receiving new evidence. At any moment, I have ar- 
gued, a scientist must believe her current opinions (and the intermedi 
ate conclusions in the arguments by which she justifies them) to be 
potentially perfectly calibrated, but she may believe that her opinions 
at other moments fail(ed) to be potentially perfectly calibrated. She may 
change her mind-at one time believing, at another moment denying 
that certain opinions match the objective frequencies.” 

” Sce she need not believe her past opinions to have been calibrated, the connec- 
non I have demonstrated between callbrahon and BC does not show that, upon receiv- 
ing new evidence, she must drachronically update her opinions m accordance with BC. 


It might be argued that an agent must not only deheve her opmions to be calibrated, but 
for those opinions genuinely to be justified, they must de calibrated. It would follow that 
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IV 

As I noted, a diachronic Dutch Book argument purports to show that 
BC is the only acceptable rule for updating opinion. It thus leaves 
room for van Fraassen’s position: that we can violate BC without vulner- 
ability to a Dutch Book if we use no rule for updating our opinions. 
But the above calibration argument leaves no room for this position. It 
shows that the intermediate conclusions in a justificatory argument can 
logically possibly all be potentially perfectly calibrated onb tf these conclu- 
sions succeed each other according to BC. No reference is made to whether 
some rule is consulted in proceeding from one step to the next. 

It might be objected that my argument does presuppose that some rule 
is involved—namely, a rule fixing which hypotheses are (by the agent’s 
own lights) relevantly similar (at a given step) to A. If the agent endorses 
no such rule, that is, recognizes no sense of hypotheses “relevantly simi 
lar” to A, then (the objection runs) there is no possibility of her reference 
classes being such that her opinions cannot logically possibly be poten- 
tially perfectly calibrated—because she has no reference classes. 

But my suggestion was not that, if the agent recognizes a sense of 
“relevant similarity” to A, then she must believe that her intermediate 
conclusion regarding A could logically possibly be potentially perfectly 
calibrated, and so the transitions between the steps of a justificatory ar- 
gument must accord with BC. Rather, I suggested that a rational 
agent must recognize a sense of “relevant similarity” to h. That is be- 
cause she must believe that her opinion of h is potentially perfectly cal 
ibrated, since this is to believe it to be nght (considering the evidence 
brought to bear by that step). She must believe it to be right since she 
believes that it should guide her in taking account of some further evi 
dence when moving to the next step in the justificatory argument. 

It is worth noting another difference between the role in the Dutch 


Book argument played by a rule for updating and the role in my ar- 


when an agent holds justified prior opinions and then, upon acquiring new evidence, 
adopts justified new opmions, this updating must conform to BC. But my argument 
that an agent must believe her opinions at a given step of a justification to be calibrated 
was, in brief, that otherwise she would have no reason to base her opinions at the next 
step on the opinions at the previous step. This argument does not somehow generalze 
to show that an agent’s opinions must really be calibrated. The opinions at one step of 
a justificatory argument are based on the opinions at the ious step, but the same 
does not hold diachronically, an agent need not be guided by her old opinions in form- 
new ones ‘That is why I am willing to require that an agent believe her opinions to be 
but not to require that agent’s opinions really be calibrated. 


that fustifled opinions be calibrated would remain crucially subjective, considering that 
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gument played by a rule determining which hypotheses are relevantly 
similar (at a given step) to A. The Dutch Book defense of BC requires 
that there be some particular rule for updating that the agent has ex- 
plicitly adopted (so the bookmaker also knows in advance how the 
agent would update, and can design the book accordingly). In com 
trast, my argument does not presume the agent to have explicitly 
committed herself to some particular rule for determining which hy 
potheses are relevantly similar (at a given step) to A. Furthermore, 
the Dutch Book argument presumes that the agent has a disposition 
to update her opinions in a certain way if she receives certain evi 
dence (and that the agent knows what her disposition is, since the 
bookmaker must know). In contrast, the calibration argument does 
not presume that the agent currently has some disposition to believe, 
under certain circumstances, that a given hypothesis is relevantly sim- 
ilar to kh. The agent may have no such disposition because she does 
not recognize much about the sense of relevant similarity upon which 
she is relying. That is, she may not have identified precisely what her 
reasons are for her opinion (at a given step) regarding A, so she may 
have no disposition to believe that her reasons in another case, for 
the opinion she would hold there regarding some hypothesis, would 
be the same as her reasons (at this step) for her opinion regarding 
h.” The calibration argument presumes only that the agent Aas such 
reasons—that her opinions are grounded in some sense of “relevant 
similarity.” This suffices, I have argued, to require that her justifica- 
tory arguments conform to BC. 

van Fraassen defends a principle he calls reflechon: that at any mo- 
ment 4 xis your degree of confidence that A is true in the event that 
at tor some time thereafter, x is your degree of confidence in A. 
(That is, reflection says that for t S t, pr, (Alpr,(h)=x) = x.) 
Roughly speaking, reflection expresses an agent’s belief that now and 
in the future, her opinions are calibrated.” van Fraassen“ has shown 


* To have reasons, it is not necessary that you be in a position to articulate them, 
or even to say whether your reasons in one case are the same as your reasons in an- 
other. To do that may require careful philosophical thought. (For more on what 
it takes for an opinion to be based on a given consideration, see Lehrer (op. at) 
and Pappas (op at.) and the references therein.) The agent might acquire a dispo- 
sition to believe that (at a given step) she has the same reasons for her opinions re- 
garding two hypotheses. Such a disposinon might be groomed by asking the agent 
to address questions like ‘Why do you have a different degree of confidence in 
Brown's five-year survival after a heart ere than Smith's, considenng that 
they are similar in all of these s 

” For a careful discussion Seale er eee eee ee ee and 
Christopher Hitchcock, “Reflections on Reflecnon,” Synthese, xcvi (1994): 297-324. 

“ “Belief and the Will.” 
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that an agent who updates her opinions from one moment to the 
next according to BC must conform to reflection; Phillip Dawid has 
shown likewise that a coherent Bayesian forecaster must assign proba- 
bility one to the event that she is and will be potentially perfectly cali- 
brated. van Fraassen and Dawid then ask how such confidence in 
one’s own current and future calibration could be warranted. 

What I showed in the previous section is not van Fraassen’s or Dawid’s 
result, but more like their converse. I showed that an agent who is poten- 
tially perfectly calibrated before and after an update must have updated 
in accordance with BC. This result is not quite the converse of van 
Fraassen’s or Dawid’s because it concerns whether an agent ts potentially 
perfectly calibrated, not whether she bebeves herself so to be. Of course, I 
have also argued that an agent must believe in the potential perfect cali 
bration of the intermediate conclusions at each step of her justificatory 
arguments. In particular, then, she must believe that the opinions at the 
final step of her justificatory argument (that is, her current degrees of be- 
lief) are potentially perfectly calibrated. But I have not argued that she 
must believe herself to be calibrated at each fiture moment. On the con 
trary, such a requirement seems just as counterintuitive as requiring that 
she believe herself to have been calibrated at each past moment.” I have 
emphasized that an agent who at one time believes certain opinions to 
be merited by certain evidence may later change her mind. Although 
BC applied to diachronic updatings entails reflection, I avoid requiring 
that an agent believe in the calibration of her future selves, since I fail to 
require an agent to conform to BC in updating her opmion from one 
moment to the next, as new evidence is received. Instead, I regard BC as 
governing the steps of her justificatory arguments. 

The philosophical challenges here lie not so much in giving the 
formal argument, at the close of the previous section, as in under- 
standing just what that argument shows. Hence, the bulk of this pa- 
per was devoted to arguing (i) that an agent must believe the 
intermediate conclusions in her justificatory arguments to be poten- 
tially perfectly calibrated, and (ii) that BC could avoid certain impor- 
tant difficulties if it was applied within justificatory arguments rather 
than diachronically. I hope to have suggested a new way that BC 
might account for various features of theory confirmation in science. 

MARK LANGE 
University of Washington 


a “The Well-Calibrated Bayeman” (with discussion), Journal of the American Statis- 
tical Association, LXXVII (1982): 605-138. 

* van Fraassen addresses objections to refiection in his “Belief and the Problem 
of Ulysses and the Sirens.” 
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PRESENTISM AND ONTOLOGICAL CO 


resentism is the doctrine that only the present is real. Since 

ordinary talk and thought are full of quantification over non- 

present objects, presentists are in a familiar predicament: in 
their unreflective moments, they apparently commit themselves to 
far more than their ontological scruples allow. 

A familiar response is to begin a project of paraphrase. Truths ap- 
pearing to quantify over problematic entities are shown, on analysis, 
not to involve quantification over those entities after all. But I think 
that we might be better off abandoning paraphrase altogether. I 
suggest a project of discovering underlying truths rather than para- 
phrases. I shall explore this strategy as applied to defending presen- 
tism, but my hope is that lovers of desert landscapes everywhere will 
herein find words of comfort.’ 


* I would like to thank John G. Bennett, David Braun, Rich Feldman, Tamar Szabé 
Gendler, John Hawthorne, Europa Malynic, and Dean Zimmerman for ther help with 
this paper. I would also like to thank Earl Conese; much of what I say here about onto- 
logical commitment has been influenced by hearing his thoughts on the topic. Finally, 
I would hke to acknowledge a special debt to Ned Markoman: this paper began as a 
commentary on his presentation to the Philosophy of Thme Society (Los Angeles, 1998). 

' In fact, the general project has more importance to me than the special case, 
since I do not aad en orse presentism In particular, I make no effort to defend 
presentism from the objections that it is inconsistent with (1) contemporary phys- 
cal geometry (Hilary Putnam claims that presentsm is incompatible with 
relatvity-“Time and Physical Geometry,” this JOURNAL, LXIV, 8 (April 27, 1967): 
240-47; and see also page 336 below); or (ii) the claim that truth is superverment on 

see David Lewis, review of D. M. Armstrong’s A Combtnatorial Theory of Poss+- 
bility, Australasian Journal of , LXX (1992): 218-19. 

Desert lovers should also consult Joseph Melia, “On What There’s Not,” Anal- 
sis, LV (1995): 228-29; and Stephen Yablo, “A Paradox of Existence,” ın Anthony 
Everett and Thomas Hofweber, eds., Empty Names, Picton, and the Puzzles of Non-Foas 
tence (Stanford: CSLI, forthcoming); many of their conclusions mesh well with the 
present approach. In particular, there 1s much in common between Yablo’s approach 
and my own, of which I became aware only after writing this paper; I cannot 
undertake a comparison ın the present paper. 
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I PRESENTISM 
A presentst thinks that everything is present; more generally, that, 
necessarily, it is always true that everything is (then) present.” 

Presentism is the temporal analogue of the modal doctrine of actual 
tsm, according to which everything is actual. The opposite view in the 
philosophy of modality is possibilism, according to which nonactual 
things exist; its temporal analogue is eternalism, according to which 
there are such things as merely past and merely future entities. 

Where possibilists and eternalists speak with quantification, actualists 
and presentists must make do with irreducible sentence operators. The 
operators are modal operators for the actualist~" NECESSARILY(@) ' 
and" POSSIBLY (@) '~and tense operators for the presentist WAS '(@) 
and" WILL’ (@), among others. Whereas an eternalist can say ‘There 
exists, located in the past, a dinosaur with a 50 foot long tail’, a presen- 
tist must say “WAS (there exists a dinosaur with a 50 foot long tail)’. 
The truth of this sentence is consistent with presentism because the ex- 
istential quantification occurs within the scope of the tense operator, 
and thus does not carry a commitment to the existence of a dinosaur, 
just as ‘POSSIBLY (there exists a unicorn)’ is taken by the actualist not 
to imply the existence of a nonactual unicorn. 

Given the presentist’s acceptance of the tenses, some have won- 
dered whether the dispute over presentism is merely verbal. The 
presentist says while the eternalist denies that everything is present. 
But is it clear that each means the same thing by ‘everything’? The 
eternalist might view the presentist’s quantifiers as being quantifiers 
restricted to present entities; likewise, it might be argued, when the 
eternalist says ‘There is a dinosaur’, this may be translated into the 
presentist’s language as “There is, was, or will be a dinosaur’. But the 
translation scheme implicit in this argument is in general not truth 
preserving. For example, the eternalist will allow that there exists a 
set containing a dinosaur and a computer; but the proposed presen- 
tist’s translation of this assertion is the disjunction: 

(There is a set containing a dinosaur and a computer) or WAS (there is 

a set containing a dinosaur and a computer) or WILL (there is a set 

containing a dinosaur and a computer). 


* On presentism, see Arthur N. Prior, “Changes in Events and Changes in 
Things” and “Quasi-propositions and Quasi-individuals,” each in his Papers on Time 
and Tense (New York: Oxford, 1968); John Bigelow, “Presentism and Propertes,” 
in James E. Tomberlin, ed., Paslosophscal Perspectroes, Votume X: Metaphysics (Cam- 
bridge: Blackwell, 1996), pp. 35-52; and Ned Markosian'’s forthcoming “A Defense 
of Presentism,” ın Aleksandar Jokic and Quentin Smith, ed., Tims, Tense, and Refer- 
ence (New York: Oxford, forthcoming). For more references, see Bigelow’s bibli 
ography and Markostan's note 3. 
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which is presumably false since at no time does there exist both a di- 
nosaur and a computer.’ 

Moreover, if the dispute between presentists and eternalists is 
merely verbal, then the same ought to be true for the dispute be- 
tween actualists and possibilists; but, surely, this is not the case.‘ The 
dispute between the presentist and eternalist, then, is genuine. Each 
uses the unrestricted quantifier ‘everything’ in the same way, as ap- 
plying to absolutely everything; one thinks this includes merely past 
and future objects, the other does not. 

Il PROBLEMS FOR PRESENTISM 
I turn now to certain problems facing a presentist, which arise from 
the fact that we often appear to quantify over merely past things: 


(D) Dinosaurs are animals that once walked the earth. 


The case of (D) is quite unproblematic; a presentist can provide a 
translation of (D) that eliminates the apparent quantification over 
dinosaurs: 

(D,) WAS (there are some animals that are dinosaurs and walk the earth). 


It might be plausible for a presentist to argue that (D>) is what we actu- 
ally mean when we utter (D), or at least that a charitable semantics 
would associate the proposition expressed by (D,) with (D). The appar- 
ent quantification in (D) over past dinosaurs could be attributed to the 
fact that ordinary speakers are often careless about the order of quanti- 
fiers and sentential operators, particularly when the difference only mat- 
ters if an esoteric metaphysical doctrine like presentism is true. A large 
class of sentences about the past and future can be handled in this way. 

But some apparently cannot be. (D) is a special case in a couple of 
ways. One is that it is purely qualitative; there are no proper names, 
demonstratives, or indexicals referring to particular entities. Another 
is that (D) talks about the past, as it were, one time at a time; there 
are no ascriptions of cross-time relations. Departing from either of 
these features of (D) raises problems for the presentist. 

Take, for example: 


(L) Lincoln was tall. 


The only way to translate (L) into a presentist truth would seem to 
be to replace ‘Lincoln’ with some description. Arthur N. Prior takes 
this route (op. at, pp. 11-14). (L) might thereby be paraphrased as: 


> Tam assuming that the presennst assumes that ıt 1s always the case that sets ex- 
ist only if all their members do. 

* Yet another reason to think the dispute is genuine is the apparent conflict be- 
tween presentism and certain empirical theones-see footnote 1. 
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(L) WAS (there is someone who is called ‘Lincoln’, who is honest... 
and who is tall). 


For a translation of (L) to be true, it must express a true proposition. 
But the proposition cannot be a singular proposition containing 
Lincoln as a constituent, since Lincoln does not exist. The only 
remaining possibility seems to be to say that the translation of (L) 
expresses a purely qualitative proposition, containing just qualitative 
properties and relations as constituents. (Lp) expresses such a 
proposition. But now the problem is that Saul Kripke and other antr 
descriptivists have argued powerfully that names do not abbreviate 
descriptions.” (Lp) is therefore not synonymous with (L). 

Cross-time relations present a quite different problem, which 
arises from a limitation on the sorts of fact that can be expressed in 
the tensed language used by the presentist. Heuristically, one may 
think of the sentence’ WAS '(@) as being true with respect to a time, 
t if and only if sentence @ is true with respect to some time before t. 
And, still heuristically, a sentence @ (that lacks tense operators) is 
true with respect to some time if and only if it is true when the quan- 
tifiers in @ range over objects that exist at that time, and atomic for- 
mulas are evaluated with respect to that time. Of course, the 
presentist cannot accept this explanation of the tense operator, as 
the explanation quantifies over nonpresent times and objects; for 
the presentist, the tenses are primitive." Nevertheless, the heuristic 
device gives the idea, much in the same way that the idea of possible 
worlds clarifies modal talk, even if in the final analysis possible 
worlds are analyzed in terms of modality rather than the other way 
around. 

The problem, then, is the following.’ 


* Kripke, Naming and Necessity (Cambridge: Harvard, 1980); Keith Donnellan, 
“Proper Names and Idenufying Descriptions,” in Donald Davidson and Gilbert 
Harman, eds., Semaniscs of Natural (Dordrecht: Reidel, 1972), pp. 356-79; 
and David Kaplan, “Demonstratives,” in Joseph Almog, John Perry, and Howard 
Wettstein, eds., Themes from Kaplan (New York: Oxford, 1989), pp. 481-564. 

* A presentist might construct surrogates for past times out of materials existng 
in the present-for example, from propositions; see, for example, Prior, p. 138. But 
the tenses will be used in constructing these surrogate times; hence the time surro- 
gates will not be available for use ın analprrng the tenses. The issues here are paral 
lel to those which arise for actualists who construct possible-vorld surrogates from 
actually exisung abstract entities-see Robert Mernhew Adams, “Theories of Actual 
ity,” Nos, vm (1974): 211-31; Alvin Plantinga, “Actualism and Possible Worlds,” 
Theona, XL (19°76): 189-60; Robert Stalnaker, “Possible Worlds,” Noûs, x (1976): 65- 
75; and Lewis, On the Pherakty of Worlds (New York. Blackwell, 1986), chapter three. 

’ The problem need not be stated with names; consider, for example, ‘Some 
current philosophers admire ancient Greek astronomers’. For a recent discussion 
of this problem, see Bigelow. 
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(A) David Lewis admires Frank Ramsey. 


(A) seems true, but what is the presentist’s translation? Even waiving 
objections to descriptivism about names, (A) is still problematic. The 
following attempted paraphrase fails to be true because, according 
to the presentist, the description “The inventor of the best-system 
analysis of lawhood’ is (currently) nonreferring: 


(Apl) The inventor of modal realism admires the inventor of the best- 
system analysis of lawhood. 


Adding a past-tense operator does not help: 


(Ap2) WAS (The inventor of modal realism admires the inventor of the 
best-system analysis of lawhood). 


This is false because Ramsey and Lewis never existed at the same 
time. (A,2) is true only if the component sentence “The inventor of 
modal realism admires the inventor of the best-system analysis of law- 
hood’ is true at some past time, and that component sentence Is true 
at a time only if both definite descriptions refer to objects then. 
Each of the following attempts is better: 


(Ap3) The person x that is the inventor of modal realism is such that WAS 
(x admires the inventor of the best-syatem analysis of lawhood). 


(Ap4) WAS (The person x that is the inventor of the best-system analysis 
of lawhood is such that NOW (the inventor of modal realism ad- 
mires x)). 


(‘NOW’ is another tense operator, analogous to the modal operator 
‘ACTUALLY’.) In neither case is there a problem with the descrip- 
tions referring, since the time of evaluation shifts, in each case, be- 
tween the occurrence of the description for Lewis and the 
description for Ramsey. But a problem remains: the atomic formu- 
las ascribing the admiration relation seem false (with respect to the 
times in question). Roughly, in (Ap3) it is asserted that at some past 
time at which Ramsey existed, Lewis admired him, even though 
Lewis did not exist then; and in (A,4) the assertion is that it is now 
true that Lewis and Ramsey stand in the admiration relation, despite 
Ramsey’s current nonexistence.‘ 


* Notice that presentism does not on its own rule out the truth of (A,3) and 
(Ap4); for that we need a stronger claim which might be called smous by 
analogy with Alvin Plantinga’s term serious actuahsm—“Replies to My Colleagues,” in 
James Tomberlin and Peter van Inwagen eds., Alm Plantaga (Dordrecht: Reidel, 
1985), pp. 316-23 and 345-49. Since presentists should accept the truth of “WAS 
(there is an x such that NOW (x does not exist))’, presentism does not on its own 
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OL. UNDERLYING TRUTHS 

My defense of presentism against the objections of the previous sec- 
tion will be subject to the following constraint: the presentist should 
not completely reject ordinary talk and thought. The presentist 
should salvage something from what we commonly say. Not just be- 
cause we say it, we should take everyday talk seriously because we typ- 
ically have decent evidence for what we say. A presentist who 
completely rejected masses of ordinary talk as just being confused 
would be a quite radical skeptic. 

But I do not assume that the presentist needs to demonstrate that 
ordinary talk is true. I follow the lead of Ned Markosian, who in a re- 
cent paper suggests biting the bullet and admitting that sentences 
naming merely past individuals or ascribing croas-time relations are 
not true (op. cti.). Markosian’s suggestion is that while such sen- 
tences are not true, we can explain their appeal by noting that there 
are related truths with which they are commonly confused. 

In light of Kripkean considerations, the qualitative sentence (Lp) 
is not synonymous with (L). (Lp) is semantically close enough to 
(L), however, that it could plausibly be said to be confuseable with 
(L). (L) seems true to us, Markosian would say, because we do not 
adequately distinguish it from (Lp), which ts true. Consider next the 
case of cross-time relations. Markosian’s example is: 


(G) There was a great grandfather of Ned. 
A truth with which (G) might be confused is: 


(Ga) WAS {there is an x that is a father of Ned & WAS [there is a y that 
is a father of x & WAS (there is a z that is a father of y) ]}. 


rule out the truth of the parallel WAS (there exists an x such that x is a dinosaur, 
and NOW (x is a dinosaur))’; for that, we need serious presentam, which may be 
formulated as the conjunction of presenusm and the additional claim that 
“positive” atomic formulas, like ‘x is a dmoszaur’ and ‘x admires y’, can never be 
true with respect to times at which the referents of the names and variables con 
tained do not exist (more carefully: for every positive atomic formula œ with van- 
ables x,...x, free, the following is true: [ALWAYS, for all x,,... ALWAYS, for all x,, 
ALWAYS: if @ then 2y...x, all (currently) extst!). I shall assume that the presentsm 
to be defended is serious presentism. 

I shall also assume the unacceptability of two other views which, if true, could 
help with some of the difficulties in the text: (1) uung temporal analogues of “actu 
ally” operators that can be indexed to occurrences of tense operators and occur- 
rences of variables and names within atomic formulas—see Graham Forbes, The 
Metaphysics of Modahty (New York: Oxford, 1985), pp 90-98; (ii) accepting the cur- 
rent existence of uninstantiated individual essences of merely past individuals; for 
the modal analogue of this view, see Plantinga, “Actualism and Possible Worlds,” 
section 5; for critidam, see Robert M. Adams, “Actualism and Thisness,” 

XLIX (1981): 3-41; and Kit Fine, “Plantinga on the Reduction of Possibilist Dis- 
course,” in van Inwagen and Tomberlin, pp. 145-86 
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The strategy here is to find sequences of temporally overlapping ob- 
jects underlying ascriptions of cross-time relations. And for cases 
like (A), Markosian’s suggestion would be the following: 


(Au) There are various properties p-p, such that (i) Lewis associates 
pr p, with the name ‘Ramsey’, (ii) WAS (there is an object that 
has p-p. and 18 called ‘Ramsey’), and (iii) if there were an object 
that had p,- p,, Lewis would admire it. 


I believe that the general idea of giving up on paraphrasing prob- 
lematic sentences is a fruitful one. The core claim here is that, even 
if we deny that the problematic sentences are true, we can still ac- 
cord them some positive status, and thus not saddle ourselves with 
an implausible skepticism. But I do not follow Markosian in idenufy 
ing this positive status with confuseabtlity with a truth. I shall seek un 
dertying truths for claims like (G), (A), and (L), but I shall not claim 
that speakers confuse those underlying truths with the propositions 
they assert. 

The problem with confuseability as the status for the underliers is 
that the underliers are going to need to get pretty elaborate, to the 
point of its being implausible that we confuse them with anything. 
(Gu) needs to be complicated, for example, since Ned’s great grand- 
father might have died and been cremated before having children, 
Ned’s grandfather being created using the great-grandfather’s 
frozen sperm cells. This sort of possibility would only be accounted 
for by a complicated tensed sentence mentioning sperm and egg 
cells, which no one would confuse with (G). In the case of (A,,), the 
problem is that an implausible descriptivism about admiring is pre- 
supposed. The properties Lewis associates with ‘Ramsey’ may not 
have been had by Ramsey; or, there might be someone else who also 
had the same properties that Lewis attributes to Ramsey, and who 
was named ‘Ramsey’, who Lewis does not admire. What attaches 
Lewis's admiration to Ramsey himself is at least in part, I think, some 
kind of causal connection between Lewis and Ramsey, and not a 
matter of properties that Lewis attributes to Ramsey. But filling in 
the details properly here will, I suspect, require a quite complicated 
underlier, unlikely to be confused with (A). My suspicion is that 
these sorts of complications will quite generally need to be made in 
Markosian’s underlying truths. 

In place of seeking underliers that are confuseable with the origi 
nals, I suggest a more modest goal of seeking truths that suffice for 
the claim in question to be, if not true, then at least quasi true, as I 
shall say. I shall introduce the notion of quasi truth informally at 
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first, and then give a more precise characterization. The working 
idea of a quasitrue sentence is one that, philosophical niceties aside, is 
true. Put a second way, a sentence is quasi true if the world is similar 
enough to the way it would have to be for the sentence to be gen- 
uinely true. A third characterization specifies quasi truth by the role 
I want it to play in my defense of presentism. To remain plausible, 
presentism should not require us to alter drastically our beliefs about 
the past; giving up on our ordinary beliefs being true, but retaining 
belief in their quasi truth, is intended to be sufficiently nondrastic al- 
teration. 

Let us look at an example in some detail. Ordinary folks say 
‘Abraham Lincoln was tall’. The presentist’s reconstruction of the 
past renders this sentence almost, but not quite true. There are no 
true singular propositions about Lincoln since such propositions do 
not exist; what is true instead is a network of tensed propositions. In 
this network are included various descriptive facts, such as the fact 
that there was someone named ‘Abraham Lincoln’, who was presi 
dent of the United States, signed the Emancipation Proclamation, 
and so on, and who was tall. Antidescriptivist reasons for thinking 
that such facts are not sufficient for the truth of the sentence in 
question are familiar. But the presentist can get much closer. For 
consider other facts that the antidescriptivist thinks are relevant to 
the truth of the sentence, such as facts about the causal chain con- 
necting Lincoln to current uses of ‘Lincoln’. Such facts have their 
presentist analogues: the network will include a variety of tensed 
facts specifying an initial baptism of ‘Lincoln’, subsequent utterances 
of that name, and so on. 

In the network, in fact, can be included everything one could say 
in the presentist’s tensed language about the relevant bit of the 
world ai a subatomic level. Thus, the presentist can provide a sort of 
supervenience base for the sentence ‘Abraham Lincoln was tall’. 
Not in the usual sense, for according to usual understandings of su- 
pervenience, the existence of a supervenience base for a sentence 
renders that sentence true.’ But the presentist can provide what 
would be a supervenience base if presentism were false—or better, if 
eternalism were true. If there is such a “quastsupervenience base” 
for a sentence S—to a first approximation, a true proposition P that 
would have been true and entailed the truth of S, if eternalism were 


* On the most common supervenience terminology, supervenience applies to 
sets of properties (or predicates) not sentences (or propositions). See, for exam- 
pie, Jaegwon Kim, “Supervenience and jent Causation,” Southern Journal of 
Philosophy, xxu (The Spindel Conference Supplement, 1984): 45-56; and “Concepts 
of Supervenience,” Philosophy and Phenomenological Research, XLV (1984): 158-76. 
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true”-then I shall call that proposition P an underlying truth for S, 
and shall call S quasi true. And finally, I say that the presentist suff 
ciently discharges her obligation to “common sense” if she can show 
that ordinary utterances about the past are quasi true. 

This characterization of quasi truth, I believe, fits my initial gloss 
of quasi truth as “truth, philosophical niceties aside,” and “similar 
enough to the truth.” Moreover, I also think the notion of quasi 
truth can play the role I have prepared for it. I said earlier that a 
presentist who completely rejected ordinary talk about the past 
would be committed to an implausible skepticism, because we typi- 
cally have good evidence for our claims about the past. But if the 
presentist can maintain that ordinary talk about the past is quasi 
true, then this skepticism is avoided. Surely, ordinary empirical evi- 
dence does not favor the truth of ‘Lincoln was tall’ over tts quasi truth. 
The reason is that ordinary empirical evidence seems qualitative." 
Ordinary empirical inquiry will justify a belief that there was an ob- 
ject with certain qualitative features. One needs philosophical argu- 
ment, which goes beyond ordinary empirical justification, to support 
the further claim that a singular proposition about a particular past 
object exists and is true. Thus, my presentist attributes to sentences 
about the past a status that is supported by ordinary evidence. More- 
over, my presentist says that the past is quite similar to what would be 
required for the truth of our utterances about it. Objectionable 
skepticism is thereby avoided. 

Once we give up confuseability as the mark of an underlier, we 
need an alternate psychological explanation of why sentences like 
(G) appear true. But an explanation is readily found: in the onto- 
logically unscrupulous nature of natural language. In ordinary life 
we do not speak as if presentism is true: we quantify freely over 
merely past objects, and we are taught to talk as if such objects exist. 
Similarly, in ordinary life we quantify freely over nonactual objects, 
and over abstracta, without thinking very hard about whether such 
objects exist; we say that there are many ways to skin a cat, without 
worrying about what, exactly, a way is. This is psychological explana- 
tion enough. 


* Worries about the definition of quam truth, including the worry that condi- 
tionals of this sort are invariably vacuously true, are addressed in section Iv below 

n In saying that ordinary empirical evidence 1s qualitanve, I do not mean to 
deny that ordinary empirical inquiry typically results in justified belief in singular 
propositions. I grant that in typical cases, given good evidence, a justified belief re- 
sults in a proposition about a particular individual, if that propomtion exists It 
does not follow that the evidence justifies the conclusion that the proposition ex- 
ists; it could simply be the case that, yf the propontion exist, it is justified. 
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It is no accident that ordinary language quantifies so readily. It is 
convenient to quantify over things you do not really believe in, if 
there is a way to pass back and forth between quasi truths and real 
truths. Even if individual speakers cannot perform these conver- 
sions, it is easy to see how a group or people could evolve a practice 
of free-wheeling quantification: the talk would be useful, whether or 
not anyone is capable of actually doing the conversions. Suppose 
that, in fact, there are no such things as propositions, by which I 
mean that there are no entities that are capable of playing the role 
propositions are supposed to play. (This might be so, if a very strong 
form of nominalism held.) Sentences involving apparent quantifica- 
tion over propositions—for example, ‘Starbuck believes everything 
Boomer says’-would nevertheless be quasi true (provided we refine 
our definition of quasi truth to eliminate the assumption of proposi- 
tions—-see section IV below). The underlying truths might be facts 
about Starbuck’s brain, and his causal relations to Boomer’s utter- 
ances. Despite the fact that no one thinks about these underlying 
truths, and no one can “translate” sentences about Starbuck’s beliefs 
into sentences about these underliers, it will be useful to talk about 
Starbuck’s beliefs in this way in virtue of the truth of the underliers. 
Since the underlying truth would be a supervenience base for the 
sentence ‘Starbuck believes everything Boomer says’ if propositions 
existed, the quasi truth of this sentence will have the same implica- 
tions for other matters, such as Starbuck’s behavior, as would the 
truth of the sentence. The sentence will be just as useful for explain- 
ing behavior and other matters as it would be if it were genuinely 
true (more on this example below). 

The strategy, then, of seeking nonsynonymous underliers for 
truths about the past requires only that the underliers suffice for 
quasi truth, and not truth; and moreover, there is no requirement 
that ordinary people have thought of the underliers. I like this strat- 
egy for dealing with ontological commitment, and not just in the 
case of presentism. I discuss other applications of the strategy below; 
for the remainder of this section, I would like to explore just how far 
the method will take us in the defense of presentism. 

First, once we give up on synonymy, we need no general paraphrase 
technique that will give “non-ad-hoc paraphrases” for truths about 
the past. For the objection that a proposed underlier is too compli 
cated, or is ad hoc, seems to be based on the assumption that the un- 
derlier is intended to be synonymous with the original. 

Second, given the way I have “lowered the bar” and required of or- 
dinary utterances mere quasi supervenience on presentist facts, the 
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presentist can answer many challenges all at once. For example: in 
the case of cross-time relations, all internal relations are immediately 
rendered unproblematic. Internal relations are those which super- 
vene on the intrinsic properties of the relata. But intrinsic proper- 
ties of objects at times may be captured in the tensed language of the 
presentist; cross-time internal relations thus quasi supervene on facts 
acceptable to the presentist. Consider, for example, the assertion 
that there is someone who is exactly as tall as some ancient Greek 
philosopher. As-tal-as is an internal relation, the holding of which 
supervenes on the heights of the relata. The nonpresentst will agree 
that the truth of any cross-time ascription of this relation will be en- 
tailed by a true tensed proposition asserting the heights of the in- 
volved objects; in the present case such a proposition might be the 
proposition that ‘There is a person who ts five feet tall exacth, and it was 
the case over 2000 years ago that there exists a Greek philosopher who is ex- 
acth five feet tall’. Given presentism, this proposition does not entail 
the truth of “There is someone who is exactly as tall as some ancient 
Greek philosopher’; but it counts as an underlying truth for this sen- 
tence since the entailment holds if eternalism is true. Therefore, the 
sentence is quasi true. 

External relations-for example, spatiotemporal relations—-do not 
supervene on the intrinsic natures of their relata. A presentist, 
therefore, will need to find a quasi-supervenience basis for all cross- 
time external relations (and also cross-time relations that are neither 
internal nor external”). But it is a reasonable hypothesis that, as the 
nonpresentist would put it, all relations supervene (globally) on the 
totality of facts about (i) where and when intrinsic properties are in- 
stantiated, and (ii) nomological matters, including causal relations 
and laws of nature. Indeed, it is a reasonable hypothesis that all facts 
whatsoever supervene on this basis.” So if a presentist can find a 
quastsupervenience basis for nomological and spatiotemporal facts, 
then she will have found a quasisupervenience basis for all facts, and 
hence will have solved, in one fell swoop, all ontological problems 
for presentism of the sort considered in this paper: any utterance 
deemed true by the nonpresentist will turn out quasi true. I con 
sider classes (i) and (ii) of facts in turn. 


™ For this taxonomy of relations, see Lewis, On the Plurality of Worlds, p. 62. 

* Haecceitists will disagree here. At this point, I think the duh needs to 
take a stand and reject haecceitism, at least about nonpresent objects; for example, 
a presentist must deny that, if eternalism were true, ‘Lincoln was tall’ could dif- 
fer in truth value between possible worlds that are alike with respect to qualita- 
tive tensed facts and with respect to singular propositions about present objects. 
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Using tensed sentences of the form ‘It WAS/WILL BE the case n 
units of tme ago/hence that there is an object with intrinsic property 
F’, the presentist can provide a quasisupervenience basis for many 
claims regarding the instantiation of intrinsic properties at past and fu- 
ture times. What is less clear is that the full range of claims of spatio tem- 
poral property instantiation in science and everyday life can be given a 
quasi supervenience basis. As above mentioned (footnote 1), I am ig- 
noring the wellknown apparent conflict between presentism and spe- 
cial relativity. But even setting this aside, there is a question about how a 
presentist will ground claims that specify both that a property was (or 
will be) instantiated, and also, roughly, where that property was instanti- 
ated. The problem would be manageable if we were willing to accept a 
Newtonian conception of substantival space persisting through time, to- 
gether with its associated notion of absolute rest and position, for then 
tensed claims of the following form could serve as undertying truths: ‘Tt 
WAS/WILL BE the case n units of time ago/hence that there is an ob- 
ject at place p with intrinsic property F’. But without this assumption, it 
is at least prima facie difficult to see how a presentist could provide un- 
derlying truths for certam statements involving the comparison of posi- 
tions at different times—for example, the claim that a certain particle has 
been in a state of inertial motion throughout a certain period of time." 
I shall say no more about this matter here, save that it is a challenge 
that, to my knowledge, presentists have not yet adequately faced. 

If the spatiotemporal pattern of instantiation of intrinsic proper- 
ties can be given a quasisupervenience basis, the defense of presen- 
tism then reduces to the problem of showing that causation and laws 
of nature quasi supervene on the totality of tensed facts that the pre- 
sentist accepts. First note that, if Humean supervenience” is true, 
this problem can indeed be solved. According to Humean superve- 
nience (as a nonpresentist would put it, anyway), the totality of facts 
about the instantiation of local qualities throughout space-time set- 
tles all other facts, including nomological facts. If the difficulty of 


” Note that we do not need to make sense of absolute rest in order to make 
sense of inernal motion-in both Minkowski and neo-Newtonian space-times, the 
latter but not the former is well defined. 

D See Lewis, Philosophical Papers, Volume IT (New York: Oxford, 1986), pp. ix-xvii, 
and “Humean rvenience Debugged,” Mind, cm (199-4): 473-90. 

© Theories of ood that are consistent with Humean supervenience inchide 
Lewis's version of Ramscy’s best-system theory of laws (the most recent exposition 
is in “Humean Supervenience Debugged”), and the traditional regularity theory. 
For a critical discussion of the regulanty theory, see Armstrong, What Is a Law of 
Nature? (New York: Cambridge, 1983), part 1. Theories of causation that 
causation in terms of laws will be consistent with Humean supervenience if one of 
these Humean accounts of lawhood is true. 
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the previous paragraph can be answered, every case that a nonpre- 
sentist would describe as the instantiation of a local quality at a 
spacetime point has a presentist analogue: a tensed fact about the 
instantiation of a local quality. Hence, if Humean supervenience is 
true, these tensed facts will form a quasisupervenience basis for facts 
about everything, and so for facts about laws. 

Suppose, on the other hand, that Humean supervenience is false. 
The leading non-Humean view of laws of nature—defended by Fred 
Dretske, D. M. Armstrong, and Michael Tooley’’s the view that laws 
of nature are relations among universals. On this view, we have a law 
that Fs are Gs if and only if the “necessitation” relation holds between 
the universals Fand G Laws would be unproblematic for presentists 
on this view if the relevant universals all currently exist, for then state- 
ments of lawhood could be straightforwardly true~quasi truth would 
not be needed. But on Armstrong’s version of the theory, universals 
do not exist unless they are instantiated; for the presentist, this for- 
mula becomes: a universal exists only if it is currently instantiated.” 
This means trouble if some universals involved in the laws used to be 
instantiated but are no longer. Imagine that a certain sort of sub- 
atomic particle comes into existence only under very extreme condi 
tions, and imagine that these conditions were only created once, in 
the past.” A presentist might try appealing to descriptive propositions 
about these merely past universals, but this works only if universals 
have essences that can be captured by such descriptive means. 

Turning next to causation: on many views, facts about causation super- 
vene on facts about laws plus the instantiation of qualitative properties 
and single-time relations. If this is true, then attributions of causation will 
be quasi true for the presentist provided that attributions of laws are 
(since instantiation of qualitative properties and single-time relations can 
be captured by the presentist’s tensed claims). But problems will arise if 
singulansm about causation is true; if, that is, causal relations hold inde- 
pendently of facts about laws. The problem is particularly acute if the 
causal relation can hold between temporally distant events.” What 


" Dretske, “Laws of Nature,” Philosophy of Science, XLV (1977). 248-68, Armstrong 
(ibid.), Tooley, Causahon: A Reahst Approach (New York: Oxford, 1987). 

* For Tooley, universals are transcendent and hence can exist uninstantiated; 
the problem discussed in the text would therefore not arise. See Tooley, pp. 72-75 
and section 3.2. 

» Compare Tooley, pp. 72-73, who discusses the parallel case of a universal that 
never is actually instantiated, but might have been if certain conditions had arisen. 

* If causally related entities were always connected by a chain of temporally 
overlapping causally related events, then the presentist might be able to utilize 
tensed claims 


claims describing these chams to provide a quastsupervenience basis for at- 
tributions of causation. 
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would be the underlying truths for this sort of causation at a temporal 
distance? 

One might provide the underlying truths by accepting a senience- 
operator account of causation. The fundamental locution on this view 
is not ‘event ¢, causes event ¢,’, but rather involves a two-place tense 
operator ‘BECAUSE @, it WILL be the case n units of time hence 
that yw’, where @ and y are filled in with sentences." Thus, if some- 
one’s current happiness is caused by someone’s eating dinner an 
hour ago, the underlier is the following: 


ONE HOUR AGO (BECAUSE someone is eating dinner, it WILL be 
the case in an hour that someone is happy). 


This seems to be the best the presentist can do here, though the 
approach places some limits on possibilities for causation. Imagine a 
world where objects pop out of existence, causing distinct objects to 
pop into existence an hour later, and suppose that balls A and B dis- 
appear, and an hour later, balls Cand D appear. Which of the two 
balls appearing were caused by which of the first two balls? It seems 
that there are two distinct possibilities; A could cause C while B 
causes D, or, on the other hand, A could cause D while B causes C. 
To account for these two possibilities, the presentist must come up 
with underlying facts that distinguish them; but there appears to be 
only a single underlying tensed fact: 


ONE HOUR AGO (BECAUSE a ball disappears, one hour hence, a 
ball WILL appear; and BECAUSE a ball disappears, one hour hence, 
a ball WILL appear). 


The presentist can distinguish these possibilities if there are qualita- 
tive differences between the balls in virtue of which the causal rela 
tions hold; if balls A and Care red and Band Dare blue, then in one 
world we can say that a red ball’s disappearing causes a red ball’s ap- 
pearing, whereas in the other world a red ball’s disappearing causes 
a blue ball’s appearing. But one would have thought that this sort of 


= For similar proposals about presentist causation (without the present reserva- 
tions), compare Adams, “Tune and Thisness,” in P. French, T. Uehling, and H. 
Wettstein, eds., Midwest Studies in Philosophy, Volume XI (Minncapols: Minnesota 
UP, 1986), pp. 315-29, here p. 321; Bigelow, pp. 39-43; and Dean W. Zammerman, 
“Chisholm and the Essences of Events,” in Lewis Edwin Hahn, ed., Ths Philosophy of 
Rodenck M. Chisholm (Chicago: Open Court, 1997), pp. 90-92. 

= We might try to make the caused state of affairs in the underlying fact de re: 

There currently exist two balls, Cand D, and ONE HOUR AGO (there exists an 

xand a y, and BECAUSE x disappears, one hour hence, C WILL appear; and BE- 
CAUSE y disappears, one hour hence, D WILL appear). 
But the truth of this would require the falsity of serious presentism, a move we are 
currently trying to avoid; see footnote 8. 
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scenario could have occurred with duplicate balls, or, more exot 
cally, with balls that, by virtue of symmetry in their worlds, have ex- 
actly the same qualitative features, both relational and intrinsic. 
Moreover, one would have thought that causal differences of this 
sort could obtain without obtaining in virtue of the qualitative 
differences of the involved objects. These are possibilities that can 
only be admitted by a nonpresentust. 

This discussion of spatiotemporal property instantiation, laws, and 
causation shows that the present method for defending presentism is 
not a blank check. My claim has been that presentism remains plau- 
sible if, or to the extent that, ordinary statements about the past can 
be shown to be quasi true. Establishing quasi truth in effect amounts 
to showing that intuitively distinct possibilities can be distinguished 
on the presentist’s terms, that is, using the presentist’s tensed lan- 
guage and assuming the nonexistence of nonpresent things. And it 
ig an Open question to what extent this can be done. Indeed, given 
certain assumptions about laws, causation, and related matters, the 
presentist cannot show that all ordinary beliefs about causation are 
quasi true. The price to pay for presentism, therefore, is rejecting 
these assumptions; the price for the assumptions is rejecting presen- 
tsm. 

IV MORE ON THE NOTION OF QUASI TRUTH 
I said in the previous section that a sentence is quasi true if there is 
some true proposition that, if eternalism were true, would be true, 
and would entail the truth of that sentence.” There are several ques- 
tions to be asked about how this definition is to be understood. 
First, since eternalism seems to be the sort of proposition that is 
metaphysically necessarily false if actually false, the conditional be- 
ginning with ‘if eternalism were true’ cannot be understood in such 
a way that it is vacuously true just in virtue of having a metaphysically 
impossible antecedent. I do not regard this as a great obstacle to un- 
derstanding the conditional. Even if the denial of presentism is 
metaphysically impossible, in some broader sense it is not impossi- 
ble; it is not impossible in the way that ‘It is raining and also it is not 
raining’, and ‘Some bachelors are married’ are impossible. For lack 
of a better term, I shall call this broader sense of possibility logical 
possibilty. I think it is plain that we do make nonvacuous sense of 
counterfactual conditionals with metaphysically impossible but logi- 


™ It 1s important that the definition require that, if eternalism were true, the un- 
derlier would be tus, as well as that ıt would entail S; otherwise, every sentence, S$, 
would tum out quam true’ the underler would be the proposition that ether eternal 
ism 1s false or S ıs true. Thanks to John Hawthorne for a helpful discusmon of this 
matter. 
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cally possible antecedents, and I propose to make sense of one my 
self in giving my definition.™ 

Second, we must ask about the strength of ‘entails’. The under 
liers I have in mind do not logically entail the target sentences, since, 
the presentist supposes, presentism is actually true but the target sen- 
tence is not. The right strength seems to be that of metaphysical en- 
tailment, by which I mean that in no world that would have been 
metaphysically possible (if eternalism were true) is the underlier 
true and the sentence false. Thus, the definition reads: S is quasi 
true if there is some true proposition that would have been true and 
metaphyswcally entailed S’s truth, if eternalism had been true. 

There is finally the question of the ontology and ideology re- 
quired by my definition of quasi truth. The commitment to modal 
talk via supervenience is at the heart af my proposal, and presumably 
ineliminable, though it should be noted that this need not require a 
commitment to unactualized possibilia; one could take modal no- 
tions as primitive, or reduce modality in some way that does not re- 
quire possibilia. A more worrisome feature of the definition is its 
assumption that underlying truths are propositions. This assump- 
tion might be unwelcome, most notably if the method of quasi truth 
is to be used to eliminate commitment to propositions themselves. 
One remedy would be to utilize sentences rather than propositions. 
This could succeed even if sentences are unsuited to generally replace 
propositions, for many of the familiar limitations of sentences do not 
affect my use of underliers (for example, the underliers do not need 
to be objects of belief or semantic values of “that clauses”). Buta 
worry persists: we do not want underlying truths to be limited to 


* On this topic, see Jeffrey Goodman, “Extended Ersatz Realism” (unpub- 
lished); Willam Lycan, Modality and Mean:ng (Boston: Kluwer, 1994), pp. 38-59; 


175-204. The counterfactual conditional may not be required; the cpaditional 
could perhaps be one of entailment where the modality is more strict than meta- 
physical necessity, but not so strict as purely syntactic or modelheoretic entail 
ment 

A skeptic might allow some counterpossible conditionals but balk at the rich array 
of counterpossibles I need—conditonals expressing complex supervenience rela- 
tions that would hold if certain metaphysically impossible theses were true. While I 
cannot give a theory of the truth conditions of the counterpossibles I need, I think 
it can be seen that the worry is unfounded. Whatever one thinks these truth condi 
tions are, exactly-perhaps conditions referring to conventions, or causal or logical 
factathe truth makers required for the modal statements I need are presumably 
available whether or not presentism ts true. Thus, the present case is quite differ- 
ent from wellknown problematic uses of counterfactuals where the truth makers 
appear to be missing—counterfactuals that “float” on nothing (see, for example, 
Armstrong, p. 31). 
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those expressible in human languages. There are various fixes, none 
perfectly satisfactory; here are two. (1) Let underliers be sentences 
in a “Lagadonian”™ language, in which sparse universals, whether or 
not humans know of them, are used as predicates denoting (express- 
ing) themselves.” Cost: commitment to sparse universals. (2) Since 
we have no need for false underliers, let the underlying truths be 


facts, rather than propositions. Cost: commitment to facts. 
V TRUTH AFTER ALL? 


I have said that a presentist can reconcile a restrictive ontology with 
freely quantifying natural language and belief by retreating to quasi 
truth. But might quasi truth actually suffice for truth? 

“Objects are in contact only if there is absolutely no space between 
them.” That is what we would have said before the conception of 
matter given to us by classical physics; indeed, we would have re- 
garded this as being definitional of contact. We know now that 
nothing satisfies the definition; must we conclude that earlier folk 
never truly ascribed the predicate ‘contact’? A common response is 
that since the world was near enough to the folk’s definition of ‘con- 
tact’, their ascriptions of contact were true. Despite the fact that the 
folk would have vehemently adhered to their original “definition,” 
their word ‘contact’ expressed a relation which held in paradigm 
cases of contact, a relation which perhaps involves lack of visible sep- 
aration, resistance to further smashing together, and so on. Lewis 
describes this view about content as follows: 


It’s an old story. Maybe nothing could perfectly deserve the name “sen- 
sation” unless it were infallibly introspective; or the name “simultaneity” 
unless it were a frame-independent equivalence relation; or the name 
“value” unless it couldn’t possibly fail to attract anyone who was well ac- 
quainted with it. If so, then there are no perfect deservers of these 
names to be had. But it would be silly to lose our Moorings and deny 
that there existed any such things as sensations, simultaneity, and val- 
ues. In each case, an imperfect candidate may deserve the name quite 
well enough.” 


If a sentence is quasi true, then the world is fairly similar to the 
way it would need to be for the sentence to be true, since there is a 
true proposition that would be true and would suffice for the sen- 
tence’s truth, if eternalism were true. On the view of content in 
question, would this make the sentence in question true after all? 

The question may not have a definite answer. Surely, there is no 
sharp line dividing “imperfect candidates” from near misses. Never- 


= See Lewis, On the Phurahty of Worlds, section 3.2 
= “Humean Supervenience Debugged,” p. 489 
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theless, I think there is reason to doubt that quasi truths are truths. 
We should distinguish between candidates for being expressed by 
sentences and candidates for being expressed by subsentential ex- 
pressions. If there is an imperfect but good enough candidate for 
being referred to by the predicate ‘contact’, it is plausible to say that 
the sentence ‘Some things come into contact with others’ is true. 
This is less plausible when there is no candidate for the predicate, 
and only a candidate for the whole sentence; the reason is that there 
is some pressure, admittedly defeasible, to respect the structure of a 
sentence in assigning it content. The sentence is, syntactically, a 
quantified sentence saying that there are objects of a certain type. It 
scems right to say that this sentence is true only if there really are 
two objects that stand in the relation of contact. 

Call an interpretation of a language weakly devious if it respects the 
logical structure of the language (in other words, gives a conjunctive 
semantics for syntactic conjunctions, a quantificational semantics for 
syntactic quantificational sentences, and so on), but “reinterprets” 
(along the lines of ‘contact’) some subsentential expressions such as 
predicates, names, functors, and the like. And call an interpretation 
strongly devious if, and to the extent that, it does not respect the syntac- 
tic structure of the language in this way. I am suggesting that, other 
things being equal, strongly devious interpretations provide worse 
candidates for reference and meaning than do weakly devious inter- 
pretations. The truth of this principle (defeasibly) counts against the 
truth of the presentist’s quasi truths, since the underlying truths I 
have imagined do not structurally match the sentences in question. 
The sentence “There was a Greek philosopher who is exactly the 
same height as someone currently existing’ is, syntactically, the result 
of applying existential quantifiers to an atomic formula; but the pro- 
posed underlying truth is expressed by the conjunction of two tensed 
existentially quantified sentences: “There is someone who is exactly 
five feet tall’, and “WAS (there is someone who is a Greek philoso- 
pher, and who is exactly five feet tall)’. 

There is a further reason why this underlying truth should not 
count as a devious meaning for the sentence “There was a Greek 
philosopher who is exactly the same height as someone currently ex- 
isting’: it would only be plausible as a sufficient condition for the 
truth of the sentence in question. One could, of course, utilize in- 
stead a disjunction of all the underliers, which would itself count as 
an underlying truth for the sentence, and would be a more plausible 
candidate for being the devious meaning of the sentence. But there 
is a further obstacle: I think there is reason to prefer stmple strongly 
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devious interpretations over extremely complex ones. This princi 
ple, if granted, would provide more reason to claim that the presen- 
tist’s quasi truths are merei quasi true.” 
VL OTHER DESERT LANDSCAPES 

Presentists are not the only ones who face the dilemma with which 
we began. Nonpresent objects are not the only dubious things 
over which natural language and thought quantify. I shall now ex- 
plore the quasi-truth defense of some restrictive ontologies other 
than presentism.™ But first let me point out that the defense will 
not work for eliminating commitment to theoretical entities in 
science—electrons, for example. No one who believes in electrons 
thinks that facts about electrons supervene on other facts~observable 
facts, for example. Our evidence for claims about electrons admit- 
tedly comes from observation, but the familiar fact that theory is 
underdetermined by data shows that it would be posstble for the 
same observational data to be caused by many possible microcon- 
figurations. Observable facts, therefore, do not form a quasi- 
supervenience base for claims about electrons, and so if there are 
in fact no electrons, claims about them will not be quasi true. This 
is, | take it, a desirable feature of the quasi-truth defense, namely, 


” In a forthcoming paper, “The Ersatz Pluriverse,” I provide a strongly devious 
reinterpretation of sentences apparently quantifying over possibilia, but it is a sim- 
ple strongly devious reinterpretation, and it 1s not too strongly devious, and so, I 
think, 1s still plausible as preserving the truth of such talk. But if I am wrong about 
this, I would not mind accepting that talk about possibilia is merely quasi true 
Note that we would then need an account of quasi truth for noncontingent subject 
matter, see foomote 28. 

™ I do not pretend that the present approach provides a completely general 
haven from commitment to unwanted entities. One case I shall not discuss here is 
that of mathematical entitles Under the present modal definition of quasi truth, 
my approach is unsuitable for a defense of nominalism about pure mathematics, 
and in general for cases involving noncontingent truth. Any proposition trivially 
counts as an underlying truth for sentences of pure mathematics simply because of 
the fact that, if mathematical entities existed, then any true proposition would be 
metaphysically necessarily sufficient for all mathematical propositions. Defending 
desert landscapes is not that casy! The presence of such “underlying truths” would 
not make the world similar to how ıt would have to have been for the mathematical 
sentences to be true; and their presence would not make platonistic assertions 
“True, philosophical niceties aside’ As for applied mathematics, the quasitruth 
defense may well be appropriate, but it seems glib to appeal to quam truth and 
leave it at that, for it 1s not immediately obvious that there are indeed nomuinalistc 
underlying truths for the statements of applied mathematics. Indeed, one could 
regard Hartry Field’s Sctence without Numbers (Princeton: Univermty Press, 1980) as 
putatively establishing that appled mathematical claims are quasi true: the repre- 
acntation and uniqueness theorems that Field discusses in chapters 4 and 68 
would hold if Platonism were true, and thus in that case, the facts captured by the 
nominalistic axiomatizations Field discusses would be a quasisupervenience base 
for platonistic scientific claims. 
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that it brings out an asymmetry between scientific and philosophical 
ontological commitment.” 

A case other than presentism where the defense might be em- 
ployed was noted above: someone who disbelieved in propositions 
could hold that statements that appear to quantify over proposi- 
tions-for example, ‘Starbuck believes everything Boomer says’—are 
quasi true; the underlying truths would be facts about Starbuck and 
Boomer on which the truth of the sentence would supervene if 
propositions existed. As I have been arguing, it is not implausible to 
deny that this sort of sentence is true, provided one can show that it 
is at least quasi true. 

One sort of argument for the existence of propositions has there- 
fore been answered, an argument premised on the apparent truth of 
sentences that appear to quantify over propositions. Another sort of 
argument, based on considerations of logical form, remains, how- 


ever.” Argument Al seems plainly valid: 


Argument Al: 
Starbuck believes everything Boomer says. 


Boomer says that The Galacica is under attack, 
Therefore, Starbuck believes that The Galactica is under attack. 


But this could only be true if (i) the first premise is interpreted as 
quantifying over propositions, (ii) the second premise attributes a 
relation between Boomer and a proposition, and (iii) the conclusion 
attributes a (different) relation between Starbuck and that same 
proposition. But if these facts about the logical form of the con- 
tained sentences are correct, then, since sentences like the ones in 
the argument are often true, in particular ones like the second 


» These considerations also undermine what otherwise might have seemed a 
promising solution to the problem at the end of section Iv of eliminating the com- 
mitment to propomtions in the definition of quasi truth. The seemingly promising 
solution 1s an alternate definition of quasi truth: § is quam true if and only if, had 
eternalism been true, S would have been true. (This would bnng my proposal 
closer to a certain kind of fictionalism—for example, the fictionalism about poss 
bila defended in Gideon Rosen’s “Modal Ficuonalism,” Mind, xax (1990): $27- 
54.) The problem 1s that this definition cheapens quasi truth by not requiring the 
actual existence of underlying truths. Granted, the conditional ‘Had eternalism 
been true, S would have been true’, when true, will hold in virtue of the nature of 
the actual world; but in some cases a sentence might turn out quasi true despite in- 
sufficient grounding in actual fact. It would be for someone who disbe- 
lieved in subatomic particles, for example, to say that ere exist carbon atoms’ is 
quasi true, and yet, the conditional ‘If there had existed subatomic partcles, then 
there would have existed carbon atoms’ is presumably true. 

™ Compare George Bealer, “Universals,” this JOURNAL, Cx, 1 ( January 1998): 5- 
82, and Peter van Inwagen’s “Meta-ontology,” Erkenninis, XLVI (1998): 233-50. 


PRESENTISM AND ONTOLOGICAL COMMITMENT 345 


premise which express relations between persons and propositions, 
there must exist propositions. 

It is always open to a nominalist about propositions to try her 
hand at paraphrase; and perhaps she can paraphrase the sentences 
in the argument in a way that preserves its quantificational structure, 
but in which the entities quantified over are deemed less objection- 
able than propositions. But I shall proceed on the assumption that 
this way out will not succeed. In particular, I shall assume that no 
nominalistically acceptable paraphrase of the first premise can be 
obtained. 

Nominalists might divide about what to say about the second 
premise and conclusion, however. Unlike the first premise, they 
contain no overt quantification over propositions. A nominalist 
might regard these simple propositional attitude sentences as being 
true, and as not attributing relations between persons and proposi- 
tions. In this case, the argument would be regarded as invalid, and 
so unsound. On the other hand, the nominalist might insist that the 
second premise and conclusion concer propositions, and are there- 
fore false (or lacking in truth value). In this case, the argument 
would be regarded as valid but unsound. In each case, then, the ar- 
gument is unsound; but this generates a serious problem for the 
nominalist. We clearly enjoy inferential success, make useful predic- 
tions, and so on, using quantificational language; if we did not, we 
would have given up using such language long ago. How can this 
success be explained if arguments like Al are invariably unsound? 

Notice first that the existence of speakers’ intuitions that Al is 
valid does not provide a strong reason for thinking that it is indeed 
valid. The nominalist would be perfectly reasonable in claiming that 
the argument only seems valid because it shares surface form with ar- 
guments that are valid—for example: 


Joe loves every friend of Frank. 
hét ia fiendof Tail 
Therefore, Joe loves Chet. 


If analytic philosophy has succeeded in anything, it has been in 
showing the dangers of being misled by the appearances of language 
(think of “the round square’, ‘the average family’, and so on). 

What the nominalist must do 1s explain how we could enjoy inferen- 
tial success in employing arguments like A1, if such arguments are in- 
variably unsound. The answer lies in the fact that, if a sentence is 
quasi true, this typically will have many of the same entailments as 
would its truth. The first premise of the argument, that Starbuck be- 
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lieves everything that Boomer says, is quasi true. It therefore has a 
true underlier, which concerns facts about Starbuck vis-a-vis Boomer, 
and which would be true and suffice for the truth of the sentence, if 
propositions existed. There is likewise an underlier for the claim that 
Boomer says that The Galactica is under attack (nominalists divide, re- 
call, on whether or not this underlier actually suffices for the second 
premise’s truth). Now, surely, these underlying truths stand in logical 
relations that match the logical relations that hold between the 
premises and conclusion in the argument, or would have held if 
propositions had existed." That is, the truths underlying the premises 
surely imply a truth underlying the conclusion-if the world at the m+ 
croscopic level is sufficient, but for the nonexistence of propositions, 
for Starbuck believing everything that Boomer says, and for Boomer 
saying that The Galactica is under attack, then the world will be suff 
cient at a microscopic level (but for the nonexistence of propositions) 
for Starbuck believing that The Galactica is under attack. Thus, the ar- 
gument has the feature of quasi validity. if the premises are quasi true 
or true, then so will be the conclusion. Since for ordinary purposes 
quasi truth is as good as truth, it follows that quasi validity is as good 
for our inferential purposes as validity, and, moreover, that quasi 
soundness (quasi validity plus true or quasi-true premises) is just as 
good for our inferential purposes as is soundness. Thus, even if argu- 
ments of the kind considered here are invariably unsound, they are in 
many cases quasi sound, and this sufficiently explains the success of 
our inferential practices. (Notice that I am not saying that ordinary 
speakers know any of this. The explanation of our inferential success 
did not depend on our appreciation of any of the above.) 

A similar account of other sorts of quantification over dubious en- 
tities can be given. We utter sentences which appear true and which 


quantify over ways: 
There are many ways in which J could win this chess match. 


and we utilize arguments which appear sound, and whose soundness 
appears to depend on quantification over ways: 


Jee has tried ie sme Way to outsmart Frank. 





Therefore, Joe has tried na a TAP ache: 


How might logical relations vary from world to world? Suppose the nominak 
ist says that in the actual world, simple proposinonal attitude sentences are true, 
and so do not concern propositions. Al is therefore invalid. But if there had ex- 
sted propositions, surely simple PE attitude sentences would have then 
concemed them; Al would then been valid. 
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Sentences that quantify over ways are quasi true despite being false, 
for if there were such things as ways, facts about ways of winning 
chess matches would supervene on facts about chess and chess play- 
ers; such facts are therefore, in this wayless world, undertying truths 
for sentences quantifying over ways. By according sentences quant 
fying over ways the status of quasi truth, we avoid being absurd skep- 
tics. And in virtue of these underlying truths, arguments of the kind 
displayed can be useful despite being unsound, for they are often 
quasi sound. 
VIL CONCLUSION 

W.V. Quine” said that the ontological commitments of a theory are 
the values of the bound variables in a first-order rendition of that 
theory. Ordinary language and thought quantifies over all manner 
of implausibilia. When first-order paraphrases of everyday locutions 
were available, Quine took that route, but where paraphrase was not 
forthcoming, Quine was only too ready simply to jettison ordinary 
thinking, and adopt new, scientific, theories and ways of talking. 

Since then, many have concluded that Quine’s willingness to 
“quine” vast realms of ordinary discourse was overzealous, and I 
agree. But this has led too many contemporary philosophers to pos- 
tulate abstracta to preserve ordinary talk. We can yearn with Quine 
for desert landscapes, yet avoid the Quinean mangling of ordinary 
talk and thought, if we reject his conception of ontological commit- 
ment. Ordinary talk and thought, while not our best theory, is a 
good theory, and should be respected. It is respected well enough if 
we regard its sentences as being quasi true. 

THEODORE SIDER 

Syracuse University 


™ See, for example, “On What There Is,” in his From a Logical Point of View (New 
York: Harper and Row, 1953), pp. 1-19. 
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REDUCING THERMODYNAMICS TO STATISTICAL 
MECHANICS: THE CASE OF ENTROPY* 


t has become increasingly clear in recent years that the founda 
[= of statstical mechanics (SM) are almost as puzzling as—and in 

some senses are more puzzling than—the foundations of quantum 
mechanics. Lawrence Sklar’s’ important review of the field presents 
many different approaches to the foundational problems in SM. 
Here, the uninitiated encounter a bewildering variety of schemes 
aimed at explaining why SM works. What strikes one about these 
projects is that perhaps all of them crucially rely on surprisingly du- 
bious assumptions. Many projects invoke indispensable “rerandom- 
ization” posits, some with Hamiltonian dynamics “turning on” and 
“turning off.” Others work only in the limit of infinite particle num- 
ber and/or time. Still others rely on special dynamical properties 
that real systems are very unlikely to possess. The arguments sup- 
porting these assumptions are typically murky. Perhaps worse, they 
leave unclear their relevance to the project of providing a mechani 
cal explanation of why classical or quantum-mechanical systems be- 
have thermodynamically. Philosophically minded observers may feel 
at a loss when they cannot trace the reasoning from a foundational 
project to an explanation of why (say) particular gases tend to fill 
their available volumes. I argue here that we are often correct to 
feel this way. Most of the approaches to the foundations of SM have 
severed their link with the original foundational project, the project 
of demonstrating how real mechanical systems can behave thermo- 
dynamically. 

My argument focuses on two different frameworks in SM: the 
Gibbsian and the Boltzmannian. I concentrate on their different de- 
finitions of entropy, since they underscore the more general concep- 
tual differences between the two approaches. I argue against the 
prevailing view that the Holy Grail of modern foundational research 


* Early versions of this paper were presented at Cambridge University’s £E Club, 
Oxford University’s Colloquium in Honour of Robert Weingard, and the Univer- 
sity of Hull’s Conference on the Mathematical and Conceptual Foundations of 
Physics. I am grateful to the audiences at all three talks for useful discussion. I 
also want to thank Guido i 1, Bob Batterman, Joel Lebowitz, Michael 
Redhead, T.M. Ridderbos, Janneke van Lith and especially Shelly Goldstein and 
Jos Uffink. Finally, I thank the LSE’s Centre for Philosophy of Natural and Social 
Sciences. 

! Physa and Chance: Philosophical Issues in the Foundatons of Statistical Mechanics 
(New York: Cambridge, 1993). 


0022-362X% /99/9607/ 848-73 © 1999 The Journal of Philosophy, Inc. 


REDUCING THERMO DYNAMICS TO STATISTICAL MECHANICS 349 


should be a Gibbsian entropy that strictly mirrors the behavior of the 
thermodynamic entropy. The search for the Grail has not only been 
in vain; it has also been misguided. Attempts to find a Gibbs entropy 
that avoids the so-called “paradox” of the conservation of the fine- 
grained entropy have proved unsuccessful. Worse, the very goal of 
these Gibbsian approaches is radically at odds with the potential re- 
duction of thermodynamics to SM. Inasmuch as one seeks a reduc- 
tion (in some sense) of thermodynamics to SM, one loses her way by 
pursuing the Grail. The Boltzmann entropy, neglected for much of 
this century, therefore deserves serious reconsideration in light of 
the failure of mainstream Gibbsian SM to tackle the above founda- 
tional project. The subsequent discussion more or less follows the 
order of this sequence of claims. 

Before beginning, let me happily concede that for the practice of 
science, Gibbsian SM is usually to be preferred. Since the values of 
all the entropy functions I discuss agree at equilibrium, my argu- 
ments are necessarily philosophical in nature. My topic is the much- 
ignored interpretation of SM. How we interpret SM affects nearly all 
of the foundational problems plaguing the subject: for example, un- 
derstanding reversibility, reduction, the role of probability. No less 
than the notorious problems in quantum mechanics, these problems 
cry out for philosophical investigation. Indeed, the demand for 
foundational investigation is especially pressing in SM; for it has long 
been recognized that, as E. T. Jaynes’ expresses it, it “is the enor- 
mous conceptual difficulty of this field which has retarded progress 
for so long” (bid, p. 392). Yet philosophers and theoreticians have 
rarely tackled the subject. As Jos Uffink remarks, this neglect is no 
better illustrated than by the fact that Paul and Tatiana Ehrenfest’s 
classic 1911 survey article still represents, in many respects, the state 
of the art.’ 

I. REDUCTION AND IRREVERSIBILITY 
Consider the canonical example of a gas confined by a partition to a 
section of a container. Remove the partition. In a short span of 
time, the gas will relax to equilibrium; that is, it will spread evenly 
throughout the box, possessing a uniform pressure and tempera- 
ture. What accounts for this regularity? According to thermody- 
namics, the gas’s approach to equilibrium occurs mainly due to the 


* “Gibbs versus Boltzmann Entropies,” Amman fournal of Physics, HOM (1965) 
391-98. 

* Ufiink, “Nought but Molecules in Motion,” Studses m History and 
Modern Physics, xxvii (1996): 378 The Ebrenfests’s classic article is translated and 
reprinted as The Conceptual Foundations of the Statestical Approach m Machantcs (New 
York: Dover, 1990) 


350 THE JOURNAL OF PHILOSOPHY 


Second Law. The Second Law states that an extensive state function, 
the total differential 6S = 6Q/T, defined only for equilibrium states, 
is such that AS 2 J6Q/T. Loosely put, for realistic systems, this im- 
plies that in the spontaneous evolution of a thermally closed system, 
the entropy can never decrease and that it attains its maximum value 
for states at equilibrium. We can therefore think of the Second Law 
as driving the gas to its new, higher entropy equilibrium state. Using 
this concept of entropy, thermodynamics is able to capture an extra 
ordinary amount of diverse phenomena under one simple law. Re- 
markably, whether they are gases filling their available volumes, two 
iron bars in contact coming to the same temperature, or milk mix- 
ing in your coffee, they all have an (indirectly) observable property 
in common: their entropy increases. 

Coffee, milk, and iron bars are not elementary individuals in our 
ontology, of course. Exactly what are is a matter of great contro- 
versy. But so long as one believes that quasi-periodic, time reversal in- 
variant (TRI) laws govern elementary matter, we can frame our 
problem in terms of classical mechanics without fear that quantum 
mechanics will side-step our problem.* (The Schrödinger equation 
is TRI and quasi periodic, so interpretations of quantum mechanics 
that do not modify or interrupt the Schrodinger evolution will face a 
problem isomorphic to the one discussed here. Other interpreta- 
tions may face similar problems, but one must deal with these on a 
case-by-case basis.) In classical mechanics, the ontology is particu- 
late, and the coordinates and momenta of all of the particles in the 
system determine the state of a system. Temporal evolution of states 
is given by Hamilton’s equations of motion, dq,/dt = 0H/dp, and 
dp,/dt = 3H/dq, I shall represent the microstates of N-particle sys- 
tems by a point X e T and a 2Nfdimensional phase space spanned 
by Nfmomenta and Nf configuration axes, where fis the number of 
degrees of freedom of each particle. As the system evolves through 
time, this representative point will trace out a trajectory through T. 

We thus have two descriptions of our gas: one mechanical and the 
other thermodynamical. We would like to know how they relate to 
one another. This kind of problem is quite familiar to philosophers. 


* Roughly put, TRI means (for deterministic systems) that, if we take any nomi 
cally possible sequence of states of a system, then the reverse sequence of “tempo- 
rally reflected” states is dynamically necessary. That the equations are quasi 
penodic, or recurrent, means the following: for systems confined to a finite volume 
and a finite energy and for almost all (with respect to the standard measure) mr 
crostates X corresponding to these systems, rf a system starts in some state X, then 
at some time in its subsequent evolution it will return either to Xor to an arbitrar- 
ily small neighborhood A, around X. 
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Philosophers of biology, psychology, and all of the special sciences 
are busy trying to demonstrate how the properties and concepts 
used by their science are (not) reducible in some sense to the prop- 
erties and concepts of “lower level” sciences. The problem is one of 
intertheoretic reduction. 

One might expect the present reduction would be comparatively 
easy to complete successfully. After all, few take the thermodynamic 
properties of gases to demand a new ontology. The thermodynamic 
properties of gases are, broadly speaking, material properties, and 
we simply need to determine the mechanical properties to which 
they correspond. The trouble we immediately face, however, is that 
the properties of fundamental physics are much different from the 
properties of thermodynamics. The classical mechanical equations 
of motion are TRI and quasi periodic, yet the Second Law of ther- 
modynamics is not. Focusing on reversibility, Hans Reichenbach" 
puts the problem the following way: 


The elementary processes of statistical thermodynamics, the motions 
and collisions of molecules, are supposed to be controlled by the laws of 
classical mechanics and are therefore reversible. The macroprocesses 
are irreversible, as we know. How can this irreversibility of macro- 
processes be reconciled with the reversibility of mıcroprocesses? It is 
this paradox which the physicist has to solve when he wishes to account 
for the direction of thermodynamical processes (thd, p. 109). 


Strictly speaking, this is perhaps not the best way to frame the prob- 
lem. The fallacy of composition threatens. In philosophy of mind, 
for example, few think that component physical systems of the brain 
ought to possess the same properties as the brain (for example, in- 
tentionality). Similarly, we should not expect aggregates of particles 
to have the same properties as individual particles. There is no para- 
dox here. In principle, values of momenta and positions exist that 
will reproduce a thermodynamic system’s motion, at least if one is 
willing to posit conspiratorial initial conditions. 

The worry is really about whether thermodynamics can emerge in 
some natural way from mechanics. Thermodynamic systems—like 
chairs, tables, and similar systems picked out by our common object 
language—are nothing more than complicated arrangements of phys- 
ical properties. Very few would disagree with this. Thermodynamics 
does not threaten physicalism. In this very weak sense, thermody- 
namics is already “ontologically reduced” to mechanics. Unlike 
chairs, however, the thermal features of objects play a role in an ex- 


+ The Direction of Time (Berkeley: California UP, 1956). 
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traordinarily powerful science: thermodynamics. We have no hope 
of defining ‘chair’ in the language of physics-we do not believe it 
has a simple nondisjunctive mechanical counterpart. The concept is 
just too coarse. But thermodynamics defines its concepts compara- 
tively precisely, and it is closer to the level of fundamental physics 
than is our common object language. It is such a powerful tool in 
organizing a vast range of phenomena, so striking and explanatorily 
powerful, that we suspect we can find a simple, nondisjunctive coun- 
terpart for it at the (statistical) mechanical level. We suspect the 
thermodynamic entropy is either identical (a posteriori) to this me- 
chanical correlate or at the very least supervenient upon it. Indeed, 
we hope SM has already found this correlate. In this stronger, more 
Nagelian sense of reduction, where the reduced theory is a “defini 
tional extension” of the reducing theory, or at least an approxima- 
tion to it, the jury is still out concerning the reduction of 
thermodynamics to SM. 

As Reichenbach recognizes, the problem of macroscopic irre- 
versibility is a warning shot for those seeking such a reduction. For 
as we know from a century’s discussion of Boltzmann’s notorious H- 
Theorem, the TRI and quasi periodicy of the dynamics place severe 
restrictions on any mechanical definition of entropy. Rather than 
rehash what should be a familiar debate, let me briefly state what I 
take to be their main point. Consider the following assumptions: 

(A) Entropy is a function $ of the dynamical variables X(t) of an indi 

vidual system. 

(B) SCX(t)) = S(X*(4), where ‘*’ indicates a temporal reflection. 

(C) The system is closed (T is bounded). 


If (A), (B), and (C) hold, then the TRI of Hamilton’s equations im- 
plies S cannot increase monotonically for all initial conditions; and if 
(A) and (C) hold, the quasi periodicy of the solutions to these equa- 
tions implies $ cannot increase monotonically for all tme. In short, 
if Sis a function of the dynamical variables of an individual system, 
then S cannot exhibit monotonic behavior. This captures the 
essence of Joseph Loschmidt’s, Ernest Zermelo’s, and Edward Cul- 
verwell’s objections to the H-Theorem (see Sklar, pp. 39ff.). As we 
shall see, it is symptomatic of the confusion in the field that the 
lessons drawn from this implication are so varied. 

Perhaps the most common response in the physics literature is to 
take this implication as effectively a reductio of assumption (A). (As- 
sumption (B) is never challenged; in section m, (C) will be chab 
lenged.) The thermodynamic entropy is understood as displaying 
monotonically increasing behavior, therefore, many reason, the SM 
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analogue of the entropy cannot be a function of the dynamical vart 
ables of an individual system. We should instead conceive of entropy in 
terms of some function of a collection of systems. Clearty, the fact that 
S(X(t)) cannot monotonically increase is compatible with some other 
function defined on an ensemble of systems monotonically increasing. 
The move to ensembles is sometimes advertised as a “way around” the 
reversibility and recurrence properties of the classical dynamics.‘ In- 
deed, Jerome Rothstein explicitly treats it as such in a paper aptly enti- 
tled “Loschmidt’s and Zermelo’s Paradoxes Do Not Exist.”" Noting 
that Loschmidt and Zermelo’s paradoxes apply to individual systems, 
Rothstein argues that sence entropy is a function rightly applied to en- 
sembles, the paradoxes do not threaten proofs of entropy increase. 
With similar reasoning, many circles have come to see denying (A) as 
the “answer” to the reversibility and recurrence paradoxes. The reader 
of Sklar’s survey will recall, for instance, the denial of (A) as a refrain 
repeated with the introduction of nearly every approach. 

Of course, this foundational problem is not the primary reason for 
denying (A). The primary reason is that Gibbs’s approach to SM de- 
nies (A), and the Gibbsian approach dominates the field (which is 
the reason why Rothstein feels he can use as a premise the claim that 
entropy is a function on ensembles). Before criticizing the move of 
avoiding the paradoxes by denying (A), we need to compare the 


Gibbsian and Boltzmannian approaches to SM. 
IL GIBBS AND BOLTZMANN ENTROPIES 


In his famous work of 1902, J. W. Gibbs* worries about the paradox 
of the specific heats, a problem plaguing the field at the time. He fa 
mously reacts to it by developing a very abstract approach to the sub- 
ject, one not so dependent upon the actual constitution of classical 
systems. When he arrives at his various definitions of entropy, he 


* More than denying A is necessary to avoid recurrence in quantum mechanics. 
If a quantum system has a finite number of degrees of freedom, then recurrence 
among individuals ts matched by recurrence of the ensemble as a whole. This is the 
lesson of a theorem by S. Ono, Mem. Fac. Kyushu Unrvernty, x1 (1949): 125; and 
(independently) Ian Percival, “Almost Periodicy and the Quantal H Theorem,” jour 
nal Of Mathematical Physics, n (1961). 235-39. The theorem shows that the expecta- 
tion value <F >, of any dynamical variable Fin an arbitrary Gibbs ensemble us an 
“almost periodic” function of t. This implies that the probability of a particular 
value of a macroscopic state must come arbitrarily close to its initial value at arbi- 
trarily large times. To avoid this, one usually goes to the so-called “bulk limit,” in 
which N —> while keeping N/V finite Doing this, however, only serves to aggra- 
vate the problems already saddling the denial of (A); see section I. 

’ Foundatrons of Plrystcs, tv (1974): 83-89. 

, Princtples in Statistical Mechanics (New Haven: Yale, 1902; reprint, 
New York: Dover, 1962). 
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tentatively offers them as “analogies” to the thermodynamic entropy 
rather than as mechanical reductions of it. 

On Gibbs’s approach, one considers a fictitious infinite ensemble 
consisting of all the microstates compatible with a given macrostate. 
The macroscopic parameters thus pick out a distribution of possible 
representative points, which we treat as a fluid in I. Upon this en- 
semble we specify a normalized density function p (p,q,t). The mean 
values of a phase function with respect to p describe the system’s 
thermodynamic properties, except for the entropy and temperature. 
For the entropy, Gibbs chose the expression: 


(1) Sx(p(X)) = -kJp(X) Hog p(X)] aT 


where the integral is over I and k is Boltzmann’s constant. This is 
the so-called “fine-grained” entropy. For isolated systems, we use the 
microcanonical probability distribution in (1) and this will match 
(up to an additive constant) the value for the thermodynamic en- 
tropy. (1) will also allow us to define a temperature. Remarkably, 
using these definitions Gibbs can recover the familiar thermody- 
namic relations for systems in equilibrium. 

Although Gibbs himself originally introduced ensembles as a kind 
of mathematical trick useful for calculating observed values of 
macroscopic parameters, influential physicists such as D. Ter Haar 
and R. C. Tolman’ dissented from this view. They claimed that the 
adoption of ensembles is not merely pragmatically desirable but is 
warranted by the inevitable imprecision of our measurements. To- 
day, it is fair to say that this is the orthodox opinion. Their use in 
SM is so ingrained that, as D. Lavis” expresses it, one needs “a con- 
siderable act of will...to avoid all references to ensembles” (iid, p. 
260). Oliver Penrose,” for instance, is able to begin a major review 
of the foundations of SM with the claim that the fundamental prob- 
lem facing the subject is justifying the choice of the correct ensem- 
ble. 

How much, if at all, the popularity of Gibbs’s methods owes to the 
denial of (A) as a response to the recurrence and reversibility para- 
doxes is a historical question I am not competent to judge. Surely, 
the calculational ease of the Gibbs approach is the primary reason 


* Ter Haar, “Foundations of Stanstical Mechanics,” Reviews of Modern i 
xxv (1955): 289; and Tolman, The ies of Stahstical Mechanscs (New York: Ox- 
ford, 1988; reprint, New York: Dover, 1979). 

* “The Role of Staustical Mechanics in Classical Physics,” British Journal for the 
Philosophy of Saence, xxvii (19°77): 255-79. 

© “Foundations of Statistical Mechanics,” Reports on the Progress of Physics, XLII 
(1979): 1937-2006. 
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why it dominates other approaches. But Gibbs’s worries about the 
paradox of the specific heats surely struck a chord with many like- 
minded scientists. Scientists often cited this paradox as a partial jus- 
tification of instrumentalism and forms of positivism.” It would be 
interesting to know whether these philosophical ideas played any 
role in the subsequent popularity of Gibbs’s theory. 

Let us now turn to Ludwig Boltzmann’s” approach, in particular, 
his later approach after the H-theorem. In contrast to the Gibbs en- 
tropy, we define the Boltzmann entropy S, for the actual microstate 
of an individual system. Consider some microstate X. X corresponds 
to a macrostate M(X), which, in turn, is compatible with many dif- 
ferent microstates. We wish to determine the relative volume in T 
corresponding to all the microstates giving rise to M. To accomplish 
this, we must partition I’ into compartments, such that all of the mi- 
crostates X in a compartment are macroscopically indistinguishable, 
that is, they share the same thermodynamic features. We do this 
however we can, but for the ideal gas we use six-dimensional [-space. 
Here, we can write the total energy of the system as a sum of one-par- 
ticle energy functions on u. The [-space description is important 
only for discovering which microstates are macroscopically indistin- 
guishable. Once we determine this, we are then able to associate 
with each Ma certain volume of I’, |T „l, which is the integral of the 
time invariant Liouville volume element over [y. Up to additive 
constants, S, is then given by: 


(2) SX) = klog [Tum l 


where |T u] is the volume in T associated with M. S, can be thought 
of as providing a relative measure of the amount of I corresponding 
to the M determined by X. 

The volume corresponding to a particular macrostate depends, of 
course, on the macrostate in question. By far the compartment with 


a See Henk de Regt, “Philosophy and the Kinetic Theory of Gases,” The Britash 
for the Philosophy of Saence, XLVI (1996): 31-62, and references therein. 

= See, for instance, Vorlesungen uber Gastheones, Volumes 1 and 2 (Leipng: Barth, 
1896-98)—translated by S. G. Brush as Lectures on Gas Theory (New York. Dover, 
1964); Theorstical Physics and Philosophical Problems: Selected Wriengs, B. McGuinness, 
ed. (Dordrecht: Reidel, 1974). For recent work on Boltzmann's theory, see J.L. 
Lebowitz, “Boltzmann’s Entropy and Time’s Arrow,” Physics Today (September 
1993): 32-38, and "Macroscopic Laws, Microscopic Dynamics, Time's Arrow and 
Boltzmann's Entropy,” Physica A, cxa (1998): 1-27, J. Bricmont, "Science of 
Chaos or Chaos in Science?” in The Fhght from Science and Reason: Annals of the New 
York Academy of Saence, DCCLXXV (1996): 131-75; S. Ito, M. Mizutani, and T. Niwa, 
“On the Time Evolution of the Boltzmann Entropy,” Journal of Mathematics of Kyoto 
Unrvernty, XXVI, 1 (1986): 1-11 
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the greatest volume in T is the one each of whose points corresponds 
to a Maxwellian distribution (in [-space). When Nis large enough 
for the system to be macroscopic (~10*) it occupies almost all of I. 
If we think of a gas in a box, it is easy to see why this is so. There are 
simply many more ways for a gas to be uniformly distributed in a 
box, with uniform temperature and pressure, than for a gas to oc- 
cupy a small corner of the box, with nonuniform temperature and 
pressure. Almost all of the available I’, therefore, corresponds to a 
compartment representing a macrostate in thermal equilibrium. If 
we adopt the “natural” Euclidean probability metric over I’, we may 
say that thermal equilibrium is the most probable state of an individ- 
ual system. At equilibrium, for macroscopic Nat a given energy and 
volume, S, is approximately equal to the thermodynamic entropy. 
In section V, I shall return to our discussion of S, in order to clarify 
how it works and how it deals with various difficulties.“ 

What is the relationship between the Boltzmann and Gibbs en- 
tropies? As the Ehrenfests and others have pointed out, they take 
the same values at equilibrium. If the generalized microcanonical 
ensemble associated with M is defined as |T y| if Xe I, and 0 oth- 
erwise, then the two are identical at equilibrium (Lebowitz). This is 
not surprising, of course, for both approaches essentially seek to calb 
culate the region of motion in I’ available to a system. 

Nevertheless, conceptually the two entropies are very different. Most 
significantly, $ is a function of the microstate of an individual system, 
in contrast to Spg, which is a function of a probability distribution on 
a fictional ensemble. This is the cause of many other deep differ- 
ences between the Gibbsian and Boltzmannian approaches. For in 
stance, the two approaches conceive of equilibrium quite differently 
and, as a consequence, they also conceive of the SM counterparts of 
the thermodynamic laws differently. For Gibbs, equilibrium is char- 
acterized by a particular probability distribution over I. At equilib- 


H The pspace procedure is not in general useful for nonideal gases. Unfortu» 
nately, due to the neglect of the Boltzmann approach, no work (to my ) 
has been done on providing a subetitute for nonideal gases. Nevertheless, since 
work began on this paper two different approaches have been suggested to me, one 
by Uffink and Janneke van Lith and another by Sheldon Goldstein. To object to 
the Boltzmann approach for this deficiency is therefore premature. It would be to 


mistake a lack Se ee ee 
culty. The di ty is cl ot a problem of principle with the definition of S}. 
There is a fact of the matter about which subsets of I are macroscopically indistin- 
guishable. Furthermore, one will focus on macroscopic functions that are invariant 
under permutanon of particles. Each subset of I will have a volume proportional 
to the number of permutations compatible with the macrostate, and when Nis large 
one or more of these subsets will become vastly larger than the rest. 
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rium Sp, will not fluctuate. By averaging over an ensemble, it has ab 
ready built into its conception of equilibrium all of the improbable 
members of the ensemble whose S, spontaneously decreases. Thus, 
in Gibbs approach, it is correct to say one has fluctuational phenom- 
ena at equilibrium, so the entropy at equilibrium remains constant as 
in thermodynamics. On Boltzmann’s approach, by contrast, one fluc- 
tuates away from equilibrium, that is, away from the most probable 
value. This is because we characterize equilibrium by a particular 
proper subset in I’, the one whose points all correspond to a nearly 
Maxwell Boltzmann velocity distribution. We know that Sẹ, must fluc- 
tuate away from equilibrium simply because it is a function of X. The 
quasi periodicy of the dynamics for X implies that (for almost every 
microstate) S cannot always remain in equilibrium. 

Thermodynamics states that once an isolated system achieves equt 
librium, it stays in equilibrium forever (until disturbed by external im- 
fluences). Gibbsian SM agrees. Gibbsians suppose that the 
probability distribution characterizing equilibrium is time indepen- 
dent. Boltzmannian SM disagrees, however, as it abandons the idea 
that equilibrium is stationary in time. The Boltzmann approach bal 
ances this affront to thermodynamics by retaining the idea that equt 
librium and entropy are properties of individual systems. The Gibbs 
approach pays for its strict agreement with the thermodynamic laws 
by relinquishing the idea that entropy and equilibrium are properties 
of individual systems. We are confronted with a difficult choice. As 
Sklar puts it, the choice is of “sticking with a concept most like the 
thermodynamic concept and exchanging the law of thermodynamics 
for a statistical regularity [Boltzmann] or of moving to an ensemble 
concept and keeping the unexceptionless law [Gibbs]” (op. cit., p. 
865). In the next two sections, I argue that there are some reasons 
for preferring the Boltzmannian single-system approach to the Gibbsian 


one when thinking about the reduction of thermodynamics to SM. 
Il GETTING THE GIBBS ENTROPY MOVING 


No sooner does one begin thinking about Gibbs fine-grained en- 
tropy than one notices the following consequence of Liouville’s 
equation. Liouville’s equation, dp/dt=0, a consequence of the 
Hamiltonian equations of motion, states that the swarm of represen- 
tative phase points representing an ensemble move like an incom 
pressible fluid. The Hamiltonian flow preserves the measure of any 
set of points if and only if their corresponding density function satis- 
fies Liouville’s equation. Liouville’s equation implies, however, that 
the fine-grained entropy is invariant in time: 


(3) dSro(p) /dt=-k] dp/dt{1 + log p} dT = 0 
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Sr, does not change, yet the thermodynamic entropy clearly does. 
This is the so-called “paradox” of the conservation of the fine- 
grained entropy.” 

Although this “paradox” is standardly taken to present a huge dif- 
ficulty for the theory, it is not immediately clear why this is so. The 
thermodynamic entropy is defined only at equilibrium. The entropy 
gradient holds only between equilibrium states. Explaining this fact 
does not require that we discover a function that is well defined and 
increasing during the interval between two equilibrium states. Find- 
ing a function defined only at equilibrium but whose value at a later 
equilibnum state is higher than at an earlier equilibrium state is 
enough. The Gibbs approach has no problem doing this. Since the 
macroscopic parameters change between two such states, we define a 
new Gibbsian ensemble for this state and this will match the thermo- 
dynamic entropy. The knee jerk reaction to the conservation of the 
fine-grained entropy therefore needs re-examination. While, in the 
words of J. Bricmont (op. ät), it may be “interesting or useful” to ex- 
tend the concept of entropy to nonequilibrium situations, it is not 
necessary in order to account for thermodynamic irreversibility. 

If there is a problem with Spg, it is instead that it is “not fær to choose 
a new ensemble description of the system at a later time” (Sklar, p. 54; 
my emphasis). The ensemble at the later time should be, it seems, the 
Hamiltonian time development of the earlier ensemble; otherwise, the 
statement that Hamilton’s equations govern the system is a fiction. Un- 
fortunately, as Gibbs found, no interesting function of the earlier en- 
semble approaches the value of that function for the later ensemble. 
The time development of the earlier ensemble will not lead it to ap- 
proach the later one in any way. This fact is a more serious worry about 
the fine-grained entropy, not that it does not monotonically increase in 
the intervals between equilibrium states. The problem, therefore, re- 
ally has much more to do with whether the fine-grained entropy can 
provide a plausible reduction basis for the thermodynamic entropy 
than it does with the simple fact that Spg does not move. 

In any case, my present target is not the fine-grained entropy itself, 
but rather the multitude of attempts to get Sye moving between equi- 
librium states. Beginning when the Ebrenfests first complained 
about Gibbs’s treatment of irreversibility, the last eighty-five years 
have witnessed the near universal acceptance of the idea that to ex- 
plain “irreversibility,” we must find some SM entropy function that 
behaves monotonically in time. These approaches either retain Spg, 


= A. Webri, “General Properties of Entropy,” Reosems of Modern Physics, L (1978): 
221-58, here p. 227. 
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and therefore modify the dynamics to escape Liouville’s theorem, or 
they define a new Gibbs-ike entropy and try to show that it behaves 
monotonically with time. The goal, which has become the Holy 
Grail of modern research in the foundations of SM, is to find an en- 
tropy function that strictly mirrors the thermodynamic entropy. In 
other words, the goal is to find some function that always increases 
until reaching a maximum value at which it remains forever. De- 
spite producing many interesting results, I believe these approaches 
are unsuccessful, unnecessary, and wrong-headed. Showing that this 
is the case is the task of the remainder of this section and the next. 

Before getting into the arguments, let us first ask why one would 
demand having a strict statistical mechanical version of the Second 
Law, one exhibiting non-TRI behavior. Why believe nomic irre- 
versibility is necessary for the task at hand? One often finds the claim 
that thermodynamic entropy increase is strictly irreversible, and 
therefore, that we must base any mechanical counterpart of entropy 
on something not TRI. This belief is based on a deep confusion 
about what it takes for one theory to correspond to another. All we 
need to do is explain how something apparently irreversible emerges 
from something reversible, and we need not accomplish this through 
non TRI laws. We merely need to reproduce the tmage of thermody- 
namics in SM at the appropriate scale, not the actual mathematical 
properties of the thermodynamic laws. When trying to recover one 
higher-level theory from a lower-level theory, we merely need to re- 
cover the phenomena supporting the theory of interest, not a literal 
statement of the reduced theory. TRI laws plus temporally asymmet- 
ric initial conditions can of course mimic the phenomena governed 
by thermodynamics, so non-TRI laws are not necessary.” 

Despite the great effort put into getting Sro moving, no successful 
approach has yet emerged, nor does any seem likely to emerge. Un- 


* Some-—tike N. Krylov, Works on the Foundations of Statistical Physics (Princeton: 
University Presa, 1979)—object that the rehance on initial conditions in the recov 
ery of the thermodynamic laws would threaten the lawfulness of thermodynamics. 
If we found the laws of thermodynamics to be dependent upon special mechanical 
iniual conditions, they would not be universally valid and would therefore lose 
their explanatory force (for example, their ability to support counterfactuals). 
This strikes me as an overreaction. If the laws of thermodynamics can only be re- 
covered mechanically by adoptung special initial conditions, then we shall merely 
have discovered that thermodynamics is a special science. Thermodynamics would 
be hke biology in its need of special mechanical inital conditions. But that does 
not imply that thermodynamics loses its modal force any more than the same utua- 
tion implies biological generalizations have no modal force. (Or if it does imply 
that, then the problem is already a huge one In either case, the point is that there 
is no special problem here for thermodynamics See my “What Is “The Problem of 
the Direction of Time’>” Philosophy of Saence (Supplement), LXM, 2 (1997): 223-34.) 
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derstanding irreversibility this particular Gibbsian way puts one in a 
hole out from which one cannot plausibly dig. Obviously, I cannot 
do justice to the rich variety of such attempts in one essay, never 
mind one section of an essay. Fortunately, most of the major objec- 
tions to the alternatives are already well documented. For this rea- 
sou, I shall quickly remind the reader of some of these difficulties, 
refer her to the relevant readings, and then make a more general ob- 
jection. I think it is primarily important that the reader get a flavor 
of the difficulty and cost of extracting oneself from the “problem” of 
the conservation of Spg. 

Coarse-graining. The usual response to the conservation of Spg is 
to devise new notions of entropy and equilibrium, in particular, the 
coarse-grained entropy and a notion of equilibrium suitable for it. 
The motivation for these new notions is solely as a means of escaping 
the above “paradox,” though it is usually defended with appeals to 
the imprecision with which we observe systems. 

In the coarse-grained approach, one divides I into many small ft 
nite cells of volume (K. One takes the average of p over these cells, 
obtaining the coarse-grained probability p, and attributes the value 
of p to all Xe Q, By substituting p for p in (1), one defines the 
coarse-grained entropy 5Sq,(p(p, Q, #). This entropy need not be 
constant in tme. Due to Liouville’s theorem, the volume p remains 
constant; however, the density of p in each cell Q, will change. Equi 
librium is now defined as the state in which p has fibrillated unt 
formly throughout the available T. One then hopes to demonstrate 
that S,, will monotonically approach equilibrium. 

One such attempt is Tolman’s generalized H-theorem. Tolman 
shows that, if one assumes that p) = Po where p, means p at time 
t=n, then it will follow that Sesel) S Scg{p,), and that Seco) 
SSo;(P,), and so on. The proof is notoriously weak, however. First, it 
is not a consequence that Sæl) S Sq,(f,). Nothing follows about 
the relative values of two Sos if both are later than the initial condi 
tion. Second, the assumption that p>) = Po is a very strong and con- 
tentious one. Third, and perhaps most worrying, the irreversible 
behavior of Sg, arises almost solely due to the coarse-graining. Al 
though coarse-graining will undoubtedly be part of our explanation 
of entropy increase, due to the large-scale differences between micro 
and macro, surely it is not the whole story. Thermodynamic behav- 
ior does not depend for its existence on the precision with which we 
measure systems. Even if we knew the positions and momenta of all 
the particles in the system, gases would still diffuse through their 
available volumes. 
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Others have tried to resurrect this style of justification by using argu- 
ments that assume a Markov condition. If the transition probabil+ 
ties for 9 do not depend on the actual past state of the system, then 
it is possible to show that this kind of rerandomization will spread p 
uniformly throughout I’. The question is whether this sort of reran- 
domization posit is compatible with the underlying deterministic dy- 
namics. To answer this, many physicists and mathematicians turn to 
the idea that the underlying dynamics is “mixing.” 

Mixing. In an attempt to make plausible the idea that a Markovian 
postulate can describe correctly the ensemble evolution, many re- 
searchers appeal to certain features of the dynamics. The property 
of mixing is particularly popular. Loosely put, an ensemble is mix- 
ing if, as p(t) spreads out in I’, its ‘threads’ are uniformly distributed 
in the limit when t->0. Advocates of this approach hope that real 
systems are mixing. If they are, this is supposed to warrant the use of 
a Markov postulate in the coarse-grained approach. 

Despite many beautiful results, the relevance of mixing to the ap- 
proach to equilibrium is doubtful. Apart from the problems of be- 
ing valid only in the infinite time limit, the mixing justification also 
suffers from the well-known “measure zero” problem. Perhaps 
worse, mixing is also neither necessary nor sufficient for thermody- 
namic behavior. Many systems exhibiting no “irreversible” behavior 
have the property of mixing (for example, a particle in a rigid one- 
dimensional box).” And due to the KAM theorem, we know many 
real thermodynamic systems are not mixing systems (Sklar, pp. 
169ff.). 

Master equations. Master equations are a kind of generalization of 
Boltzmann’s kinetic equation. They are linear Markovian rate equa- 
tions for the probability distribution of a system. The idea here is to 
show that the probability distribution for a system approaches the 
equilibrium distribution under certain conditions. These demon 
strations are successful only when one uses the infinite time limit, 
continually rerandomizes the system, and posits special random ini- 
tial ensembles. The justification for any of these conditions and the 
relevance of the result if successful is obscure.” 

Interventionism. No realistic thermodynamically closed system is me- 
chanically closed. Treating them as such is clearly an idealization. A 
real gas in a box is subject to all sorts of ignored, uncontrollable per- 
turbations: for example, the gravitational pull of the moons of Jupiter, 
quantum fluctuations of the walls of the container, cosmic rays, and so 


" SK. Ma, Statistical Mechanics (Singapore: World Scientific, 1985), p. 446. 
4 See Sklar for critical review. 
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on. Interventionists take these perturbations as essential to why em 
tropy increases. They hold that these uncontrollable perturbations 
warrant abandoning Hamiltonian dynamics and hence Liouville’s the- 
orem. This rejection of the classical dynamics allows them to write an 
equation for the fine-grained entropy which does not have the effect 
of leaving it an invariant of the motion. Modeling the “random” ex- 
ternal influences with a stochastic equation of motion, perhaps as P. 
Bergmann and J.L. Lebowitz” do, they are able to circumvent the con- 
servation of the fine-grained entropy. They therefore have no need to 
“brave the ambiguities of the coarse-grained entropy formulae,” as 
Penrose puts it (op. at, p. 1938). (In terms of our earlier problem, 
interventionists deny assumption (C), but most deny (A), too.) 

As an answer to the foundational questions in SM, however, I 
think interventionism is a nonstarter. If classical or quantum me- 
chanics is the fundamental theory of the world, then it governs the 
environment of closed systems, too. Ifso, then Hamilton’s and Liou- 
ville’s equations govern the environment, too, and thus the total 
fine-grained entropy of system + environment does not increase. 
One might then refer to the fact that the system + environment is it- 
self subject to uncontrollable external perturbations from its envi- 
ronment, and that with respect to these perturbations, the entropy 
of the combined system increases, and so on a la Reichenbach. But 
clearly this is a mug’s game. Unless one is willing to bite the bullet 
and claim that the TRI quasi-periodic laws of mechanics only govern 
the universe as a whole, and not its subsystems, then this method of 
pushing the explanation back further and further will always be in 
vain. But to do this is to claim that the laws of nature do not govern 
the very items that are evidence for believing the laws of nature, that 
is, the subsystems of the world. The fundamental problem with in- 
terventionism is that it wants to have it both ways. It does not want 
to offer new fundamental laws (for example, as Ilya Prigogine or 
David Albert” do) but it does not wish to be constrained by the cur- 
rent TRI deterministic laws either. 


” “New Approach to Nonequilibnum Processes,” Physical Renew, xcrx (1955): 
578-87, See T.M. Ridderbos and M. Redhead, “The Spm Echo Expenments and 
the Second Law of Thermodynamucs,” Foundations of Physics, xxvill (1998): 1237-70, 
and references therein for defenses of interventionism. 

™ Albert points out that according to the GRW interpretation of quantum me- 
chanics, the indeterministic and non-TRI Laws of nature may be helpful to an un- 
derstanding of irreversibility; see “On the Foundations of Quantum Mechanics and 


the Approach to Thermodynamic Equilibrium,” British Journal for the Philosophy of 


mechanics; see “A Unified Foundation of Dynamics and Thermodynamics,” Chew- 
ica Scripta, IV (1973): 5-32. 
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In addition, the approach founders on justifying the stochastic 
terms used in modeling the environment and on justifying the nec- 
essary time asymmetric effect of the environment. If the environ- 
ment does not operate in a random way, interventionism will not 
work. We would have no guarantee that systems will tend toward 
equilibrium. (If we do not want such a guarantee, or we have other 
reasons to think systems will tend toward equilibrium, then interven- 
fionism is unnecessary.) But we cannot justify these random influ- 
_ ences (as they often are) by appealing to our inability to control or 
predict the effects of the environment. Uncontrollable and unpre- 
dictable influences, we know well, are not the same as genuinely ran- 
dom influences. If deterministic laws govern the environment, they 
cannot be genuinely stochastic. A similar problem arises concerning 
time asymmetry. If the approach is to avoid implying that entropy 
increases toward the past, time asymmetric laws must govern the en- 
vironment if the approach is not to beg the question. Again, unless 
we adopt new, in this case non-IRI, laws of nature for the environ- 
ment, this assumption will not be warranted. Finally, we can ask 
whether it is really plausible or necessary to suppose that (say) the rea- 
son why gases expand through their available volumes is in part due 
to the locations and sizes of the moons of Jupiter. 

Radical approaches. Given the problems with all of these ap- 
proaches, it is not surprising that some physicists turn to quite radi 
cal approaches to the problem of irreversibility. One is due to 
Jaynes. This is the view that entropy increase is due to the informa- 
tional state of the observer and the nature of statistical inference. 
The other is due to the so-called “Brussels School” led by Prigogine 
and his theory of subdynamics. This approach alters the fundamen- 
tal notion of a state of a system, replacing trajectories by probability 
distributions. While one can sympathize with both approaches and 
agree that they are both ingenious in different ways, in my opinion 
neither view can be sustained.” 

Finally, though each approach has its own particular difficulties to 
overcome, they all (with the possible exception of the Brussels 
school) share a very general problem, what Sklar dubs the “parity of 
reasoning problem.” Either the reasoning behind the approaches 
also works for the past direction of time as well as the future, in 
which case the theory predicts entropy increase toward the past, or it 
smuggles in an unwarranted temporally asymmetric assumption, 


1 For criticism of the second view, see Robert Batterman, “Randomness and 
Probability in Dynamical Theories: On the Proposals of the Pngogine School,” Phr- 
losophy of Saence, ivm (1991) 241-68. For cnticism of both views, see Sklar. 
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analogous to Boltzmann’s notorious Stasszahlansatz. This assumption 
typically either conflicts with the Hamiltonian dynamics or merely 
ends up pushing the question back a step. Falling victim to the first 
horn of this dilemma implies the theory of interest is wrong half the 
time; falling victim to the second half implies the theory is either in- 
consistent or incomplete. For this reason, along with the multitude 
of less general difficulties, I consider it unlikely that any of these ap- 
proaches ever will yield a satisfying explanation of irreversibility. 
IV. REDUCING THE THERMODYNAMIC ENTROPY 

We have just seen how hard it is to find a Gibbsian entropy function 
that monotonically increases in time. But even if one manages to 
skirt the above objections and define an entropy with the proper be- 
havior, that alone does not make a function suitable as a mechanical 
counterpart of the thermodynamic entropy; for, as Jaynes reminds us 
when discussing Boltzmann’s H-function: “Merely to exhibit a math- 
ematical quantity which tends to increase is not relevant to the sec- 
ond law unless one demonstrates that this quantity is related to the 
entropy as measured experimentally” (op. cit, p. 392). With this sen- 
timent in mind, I wish to make a sweeping objection against any ap- 
proach that defines an entropy function that increases strictly 
monotonically with time. 

Monotonically increasing ensemble entropies are generally re- 
garded as the mechanical version of the thermodynamic entropy. 
But are they suitable for this role? Sklar notes well the difference be- 
tween entropy as a feature of individual systems and entropy as a fea- 
ture of probability distributions. He cautions us not to move too 
quickly in our reduction of the thermodynamic entropy to the en- 
semble entropy, remarking that this identification is fraught with dif- 
ficulty. He claims, rightly I believe, that to assert an “identity” 
between features of ensembles and thermodynamic features would 
be to misrepresent the relationship between the two theories (op. at, 
p. 361). But he never adjudicates the debate between the single- and 
many- system approaches as they relate to reduction. One suspects 
Sklar thinks of the Gibbsian reconception of entropy as a property of 
ensembles rather than a property of individual systems as simply an 
instance of concept extension or revision. The idea of concept ex- 
tension or revision is a familiar one to philosophers. When applying 
a concept to a new context, we may slightly reform its meaning. The 
reformed concept may not preserve the original term’s intension or 
extension, yet its meaning may not have changed sufficiently to war- 
rant calling it a new concept. What counts as sufficient? Clearly, 
there is no hard and fast rule telling us how to distinguish reformed 


ett y 
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concepts from new concepts; rather, we make intuitive judgments 
about whether the reformed concept changes from the orginal only 
in “peripheral” features. Undoubtedly, we should expect such an ex- 
tension in the present case. Whenever we explain the concepts of 
one theory in terms of those of another we inevitably use concepts 
with features different from the reduced theory’s concepts. When 
we move from thermodynamics to SM, for instance, we extend the 
concept of temperature to absolute negative values for spin systems. 
On a SM treatment of temperature, this seems to make sense (but see 
Uffink, op. cit). Since “being positive” arguably is not essential to 
the thermodynamic temperature (like its relationship to pressure is), 
this appears to be a case of concept extension and not concept sub- 
stitution. Yet is the switch from an entropy defined for individual 
systems to one defined for ensembles merely an instance of a surpris 
ing concept extension? 

It may be. Perhaps we can understand features of ensembles as 
properties of single systems. Gibbs himself seems to have thought it 
possible to eliminate reference to ensembles, although it is not en- 
tirely clear what he had in mind.” Nevertheless, denying that en 
tropy is a property of individual systems tn order to avoid recurrence and 
define a monotonically tncreasing entropy function does not so much elas- 
tically stretch the original concept as it does completely break from 
it We can see this in a variety of more or less striking ways. The 
simple argument that follows strongly suggests that no monotoni- 
cally increasing function of time should be identified as the reduc- 
tion basis of the thermodynamic entropy. 

Suppose one is able to define a function of an ensemble, G(p), 
that increases monotonically with time until reaching a maximum 
value, at which it remains forever. Gis defined for an ensemble, for 
we know from above that if it were not (that is, if assumption (A) 
were affirmed) there could be no such G. Finding a function like G, 
or a proof of such behavior, is the goal of many in the foundations of 
SM. If we could find such a G we could identify it with the thermo- 
dynamic entropy, and the thermodynamic Second Law would re- 
main a strict, unexceptional law. 

Suppose we wish to describe a gas diffusing throughout its avail 
able volume into a new equilibrium state. We know that, though it 
does not matter to the definition of G, the actual gas has a mi- 
crostate X. We also know that X, whatever it is, gives rise to the 


™ See Jan von Plato, Creahug Modern Probatihty (New York: Cambridge, 1994), p. 
88, and references therein. 
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macrostate M we see before us. These are merely the assumptions 
we make when we say thermodynamics is in some sense reducible to 
mechanics. They are completely uncontroversial. Surely, the gas 
has a microstate, and surely whatever microstate it occupies corre- 
sponds to the macrostate we see. Now, if we make one more meager 
assumption, the irrelevance of Gis assured. That assumption is that 
classical or quantum mechanics always govern the evolution of X. 
For Poincare’s recurrence theorem implies that for almost all mi- 
crostates, at some subsequent time in their evolution they will return 
either to X or to an arbitrarily small neighborhood A, around X. 
Since this neighborhood can be arbitrarily small, the difference be- 
tween Xand X + A, will not matter to the macroscopic description 
of the state. Xand X + A, will correspond to the same M, whatever 
it is. 

We can now exhibit the problem with the proposed identification. 
The behavior of our function G supposedly accounts for the Second 
Law, even though this explanation is compatible with the real mi- 
crostate of the gas being one that subsequently leads to a macrostate 
that the system nearly already occufned. G will continue to increase, or 
remain constant, even if the direction of the thermodynamic en- 
tropy turns around. How do we know the thermodynamic entropy 
would turn around in such a circumstance? We know this because 
the thermodynamic entropy is a function of merely the endpoints of 
the evolution and of the macrostate of the system. Let S, be the 
thermodynamic entropy of macrostate M,, which corresponds to the 
gas confined to a small section of a container. Let S, be the thermo- 
dynamic entropy of macrostate M,, which corresponds to the subse- 
quent state of the gas after it has spread evenly throughout its new 
volume. The thermodynamic entropy, whatever it is, is given by AS, 
= $,(M,) -S (M,). But when the system recurs to an earlier 
macrostate-in this case from M, to M,-its thermodynamic entropy, 
whatever it is, is given by AS, = S,(M,) - S,(M,). The changes AS, 
and AS, cannot be of the same sign. The evolution of the thermody 
namic entropy therefore will change direction during recurrence. 
This is obvious, especially when we consider a film of gas diffusing 
run backward, but it helps to go through the reasoning to expose 
the uncontroversial premises of the argument. One might object 
and point out that the thermodynamic entropy does not in fact de- 
crease. Yet that misses the point if the system in question is a me- 
chanical system it can recur (and will recur, given enough time). It is 
physically possible, maybe even actual, that the value of the thermo- 
dynamic entropy not even approximately match the value of G. 
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Let us now reconsider the mechanical explanation of the gas re- 
laxing to equilibrium provided by function G. Apparently, its expla- 
nation of why the system approaches equilibrium will “work” even if 
the diffused gas suddenly starts to concentrate in one small corner of 
the container! This is the upshot of avoiding the recurrence theo- 
rem. Any theory that explains entropy increase with a function such 
as G will be forced to say its SM entropy is increasing, or remaining 
constant, even while its thermodynamic entropy decreases. The only 
other option is to give up one of the meager assumptions that cre- 
ated this predicament. But since these are so minimal, to abandon 
one of these is to give up the game. 

Clearly, even without going into all the gory details concerning 
the many uses of the term ‘reduction’, we cannot say that the ther- 
modynamic entropy reduces to Gin any sense.” The entropy to be 
reduced does not supervene upon the reducing entropy, yet superve- 
nience is usually considered a minimal condition for any reduction. 
Lacking supervenience, it is hard to imagine how the other meta- 
physical relationships philosophers sometimes desire of reduction 
could hold. We cannot view G as caustng the thermodynamic en- 
tropy’s behavior, since the two do not even covary. Nor can G be a 
posteriori identified with the thermodynamic entropy. There are 
nomologically possible worlds where the two entropies do not have 
the same behavior. Since any scientific identification holds across all 
physically (and perhaps metaphysically) possible worlds, the fact that 
there is a possible world where the two entropies differ implies they 
cannot be identical. So on any theory of reduction remotely 
Nagelian in spirit, where one looks for a correspondence (even ap- 
proximate correspondence) between a concept at the reducing level 
and a concept at the reduced level, we do not have a reduction. This 
is the price of making G’s evolution independent of the individual 
system's evolution: G becomes irrelevant to the individual system. 
The drive to find a function that behaves like G, we see, runs directly 
contrary to the desire to have a mechanical reduction of the thermo- 


dynamic entropy. 

V BOLTZMANN REVISITED 
SM need not and should not search for a mechanical counterpart 
for entropy that exhibits monotonic behavior with time. For when 
we find such an entropy, we immediately know that it is irrelevant to 
the behavior of real individual systems in the world. If this is right 
how should we react to this conclusion? 


” For some of the gory details, see M. Spector, Concepts of Reduchon m Physical 
Saence (Philadelphia: Temple, 1978) and references therein 
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One reaction might be to take it (either happily or sadly, depend- 
ing on one’s other views in philosophy of science) as announcing 
that we cannot have a SM counterpart of the thermodynamic en- 
tropy. The thermodynamic entropy, then, would be viewed as a con- 
cept more like ‘chair’ than like ‘carbon’. One might then either 
adopt a nonreductivist or an eliminitivist stance toward the thermo- 
dynamic entropy. The options and moves available here are familiar 
from the philosophy of mind literature on folk-psychological terms. 

I think it is a bit early, however, to give up the search for a SM 
counterpart of the thermodynamic entropy. The thermodynamic. 
entropy is a real (indirectly) observable property of individual sys- 
tems. That the major Gibbsian approaches to SM have not found it 
does not imply a mechanical counterpart does not exist. Rather, if 
we cannot find it in these approaches, then we ought to look else- 
where. 

One entropy worthy of more investigation is the unaltered Gibbsian 
fine-grained entropy Sz. We saw that the conservation of Srg is not 
the problem many believe it to be. Although there remain questions 
about it, I think it premature to close the door on Srg from the per- 
spective of reduction. 

Another entropy worthy of much more thought is the largely ne- 
glected Boltzmann entropy-in particular, the reconstructed, post-H- 
Theorem entropy recently championed by Lebowitz (op. at) and 
others. The philosophical literature on SM almost completely ig- 
nores this entropy. And despite a few articles articulating the theory, 
I believe it is fair to say that many prominent physicists remain con- 
fused about this approach. Certainly, the letters replying to the 
Lebowitz article make this abundantly clear.* The reason for the ne- 
glect lies in a complicated web of different influences. Boltzmann’s 
later theory is often confused with his earlier, infamous H+heorem.” 
Moreover, his own writing is difficult to follow and his reasoning is 
often convoluted. When combined with several common philosoph- 
ical mistakes regarding Zermelo and Loschmidt’s objections to Boltz- 
mann, and the generally confused state of the field, such 
misunderstanding is not terribly surprising. In any case, though I 
cannot guarantee that this approach is problem free or even well de- 
veloped, I believe it represents a promising candidate for under- 


™ Physics Today (November 1994): 113-15. 

= Jaynes shows that the H-function differs (nonnegligibly) from Syo whenever 
the potential energy of a system is relevant (that is, when there are interactions). 
Consequently, since Syo agrees with 5, at equilibrium, the H-function differs (non- 
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standing why mechanical systems tend to approach thermal equilib- 
rium. Due to its neglect and the confusion surrounding it, I believe 
it is worth describing the position in greater detail than I did in sec- 
tion N. I would like to do what I can in this section to shed some 
light on potentially confusing points. I shall not claim that the the- 
ory yet offers a satisfactory reduction of the thermodynamic entropy. 
Although I believe it to be more promising than mainstream Gibb- 
sian approaches, I shall show that it still faces some challenges. 

What kind of behavior should we expect from Sẹ Unlike Srg, Sp is 
not an invariant of the motion, despite Liouville’s theorem. To see 
this, consider the volume of macrostate MinT at T= 0,/T'y|. I shall 
represent the time development of the points in| Ty] at T=tas|T yl. 
Liouville’s theorem implies that |T| = |T u'l. But this does not imply 
that the volume of the new macrostate M* in T at T=¢% IT el, is equal 
to IT yl. Indeed, because most of the states in Tél will not usually 
have evolved from the relatively tiny region |T ul, |T u'l is not only un 
equal to |I we'l, but it is typically not even close to as large as |T u.l. Thus, 
Sp is free to vary with time. 

Furthermore, it is overwhelmingly likely to increase. One can see 
this qualitatively as follows. Consider a gas in equilibrium confined 
to a small portion of a box. It will correspond to a certain volume 
Iul. Now, when we remove the partition holding the gas inside that 
small section, the values of the macroscopic variables will change, 
opening up a large new volume in T available to the system. Since 
the new equilibrium distribution occupies almost all of the newly 
available phase space, almost all of the microstates originating in the 
smaller volume will evolve into this region |T,.| of the larger phase 
space. Once the representative phase point of the system evolves 
into this larger region, it stands virtually no chance of evolving back- 
ward. Since S, is the relative measure of the amount of I’ corre- 
sponding to a macrostate, we can see that S, will increase when the 
volume corresponding to M increases from |I yl tolul. Notice also 
that this explanation, unlike many in SM, works only for large N, 
which is reassuring. J.C. Maxwell” expressed the essence of this ex- 
planation when he wrote: “The Second Law has the same degree of 
truth as the statement that if you throw a tumblerful of water into 
the sea you cannot get the same tumberful of water out again” (ibid, 
p. 583). 

We need one more assumption before our sketch of Boltzmann’s 
theory is complete; for as the reader has probably noticed, I have not 


™ PM. Harman, ed, Tha Sasnizjic Letters and Papers of J.C. Maxwell, Volums IT: 
1862-1873 (New York: Cambndge, 1995). 
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yet addressed Sklar’s “parity of reasoning” problem. It is still true 
that for typical microstates S, will increase both toward the future 
and toward the past from present nonequilibrium states. That is, 
given a nonequilibrium state, we predict that $, will increase toward 
the future and that the system evolved from a system with higher Sy- 
The solution to this problem, as Boltzmann and many other lum? 
naries saw, lay in appealing to temporally asymmetric boundary con- 
ditions. Richard Feynman” notes this clearly: “it is necessary to add 
to the physical laws the hypothesis that in the past the universe was 
more ordered, in the technical sense, than it is today” (thd, p. 116). 
By assuming that what we call the earlier states of the universe are of 
comparatively low entropy with respect to what we call the later 
states, we remove the unwelcome retrodiction. Earlier states do not 
have higher entropy than present states simply because we assume 
that sometime shortly after the Big Bang entropy was almost unimag- 
inably low. Following Albert, let us call this cosmological hypothesis 


the past hypothesis: 


Past hypothesis: the initial (or very early) macrostate of the universe 
is one of extraordinarily low entropy. 


The past hypothesis solves the problem of the direction of time. 
Whether it adequately explains the direction of time is another ques- 
tion (see my op. at). It also tells us that we can live with recurrence. 
Given that the universe currently resides so far from equilibrium, it 
is overwhelmingly probable that entropy did and will increase for all 
relevant observation times. 

Before turning to two worries about the Boltzmann approach, let 
me try to dispel the idea that there is anything unpleasantly subjec- 
tive about S,. The coarse graining used to define S, does not imply 
that S, is subjective, only that it is relational. The thermodynamic 
variables are used to characterize the macrostates of interest. Which 
microstates give rise to a given thermodynamic state is an objective 
matter. It does not rely on the precision with which we can measure 
a state or our knowledge of a state. If a Laplacian intelligence in- 
formed you of the exact microstate of some macrostate, that would 
not affect the value of Sẹ one jot. You would have more information 
about the actual microstate of the system than S, could provide, but 
S, is completely independent of how much you know. This is true 
despite S,’s being relational. As Jaynes (op. at) points out, a crystal 
of salt may have two different entropies: one expressed as a function 


© The Character of Physical Law (Cambndge. MIT, 1965). 
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of temperature, pressure and volume, S(T, P), another expressed, 
say, as a function of temperature, the strain tensor and electric polar- 
isation, S(T,e E). But this does not imply, as Jaynes thought, that en- 
tropy is anthropocentric in nature. It implies merely that 
Boltzmann’s entropy, to its credit, reproduces an ambiguity already 
existing in thermodynamics. How many microstates correspond to a 
particular thermodynamic description is still an objective matter, 
even if a system admits of more than one such description.” 

S, seems to have the credentials we want in a reduction basis for 
the thermodynamic entropy. It is an objective function of the state 
of individual systems. The actual microstate of a gas matters to the 
Boltzmannian explanation of thermodynamic behavior, in the sense 
that we can tell whether an individual system is in equilibrium or 
not. Further, the entropy behaves the right way. For these reasons, 
and some other mathematical features (for example, its additivity), 
Spa seems very promising.” Before we get too excited, however, let 
me finish with three questions about 5, which are not often dis- 
cussed. 

First, for the Boltzmannian approach to work the actual mi- 
crostate needs to be a generic or typical one for M (in the language 
of Lebowitz). We know from the argument of section I that we can- 
not expect entropy increase from all microstates all of the ime. We 
also know, however, that “most” microstates will tend toward equilib- 
rium, since the size of the regions deviating from thermal equilib- 
rium (according to Lebesgue measure) becomes negligible for large 
N. As N > œ, the proportion of these typical states in [ goes to one. 
But since N#-o for real systems, the set of atypical points has positive 
measure. What justification do we have for ignoring microstates in 
this set? And are we not just begging the question if we assume the 
actual microstate is always in the set of typical points? Have we really 
accomplished anything? (This problem is similiar in some respects 
to the so-called “measure-zero” objection to ergodic theory—only 
here the atypical set is not measure zero.) 

It is important not to overreact to this problem. The worry is es- 
sentially asking for an independent justification of the imposition of 
our “natural” probability metric on T, which is something we do not 
have. But we can only try to solve one problem at a time, and any 


= Another potental source of subjectivity lay in the fact that to define S, we 
must “discretize” I’ into small cells So long as the number of cells grows slowly 
with respect to N, however, this will not affect the behavior of Sp 

» It is worth noting that we can extend this approach to quantum mechanics. 
The volume corresponding to a macrostate has a natural analogue in Hilbert 
space, namely, the dumension of the projector on the macrostate. 
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way, the problem of justifying the “natural” probability metric is a 
very large one common to all of the different approaches to SM.™ 
Indeed, it is a problem wherever we use probabilities, even outside 
physics (for example, in games of chance). But, surely, it is interest- 
ing and striking to discover that according to a very natural measure, 
Lebesgue measure, nearly all of the initial conditions compatible with 
a macrostate are such that Hamiltonian evolution will take them to 
states closer to thermal equilibrium. And, surely, it is interesting and 
striking (for instance) that it can be rigorously shown that this domi 
nant set of microstates will, in fact, have their time evolution approx- 
imated by generalizations such as the Boltzmann equation.” To be 
sure, the typicality assumption is somewhat undesirable, but given 
the lesson of the reversibility and recurrence objections, I do not 
know how we can escape ultimately making such an assumption with- 
out changing the dynamical laws. 

Furthermore, intuitively, the evolution of atypical microstates is 
very contrived, as these states systematically avoid evolving into the 
larger compartments of I. One hopes that some mild dynamical 
constraints that will prohibit this “contrived” behavior. Computer 
models and toy models such as the Kac ring model already suggest 
that these conditions will be weaker than ergodicity or mixing. De- 
velopment of the Boltzmannian approach should include a state- 
ment of precisely what dynamical constraints one needs. 

This point raises my second question: How should we understand 
the “accessible regions Ire corresponding to X? After all, if our 
system is not ergodic (metrically transitive), then I° will decompose into 
subsets with boundaries that cannot be crossed by X. In what sense, 
then, is |T „| accessible to X, if T,| overlaps one or more of these 
subsets? As Uffink expressed the problem to me, what dynamical rel 
evance does the relative size of |l „| have? One partial answer might 
be that the compartments of [ representing equilibrium are so large 
that, even if X is trapped in some subset of I’, still within that subset 
“most” of the states are ones either representing equilibrium or 
states closer to equilibrium. Another hope might be that the dynam- 
ical constraints mentioned in the above paragraph will show that 
most of |I q] is, in fact, accessible to X. 

Finally, and more philosophically, it is commonly thought that 
one theory reduces to another only if the reducing theory in some 


» One potential sohition to this problem is found in the GRW mterpretaton of 
quantum mechanics-see Albert. 

"n See Oscar Lanford M, “The Hard-Sphere Gas in the Boltzmann-Grad Limit,” 
Physica A, CVI, 70 (1981): 70-76. 
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sense explains the potentially reduced theory. If we understand ‘ex- 
plain’ as ‘causally explain’, for instance, can S, play a role in a causal 
explanation of why systems tend toward thermal equilibrium? While 
playing such a role may not be necessary for the success of Sp, cer- 
tainly it would be attractive. One reason to think it cannot fill this 
role is that $, essentially measures the number of microstates the sys- 
tem is not m but could have been without us noticing. As this num- 
ber does not say much about the nature of X itself, at first glance it 
hardly recommends itself as the sort of thing useful to a causal expla- 
nation. But as Peter Railton” points out: “The stability of an out- 
come of a causal process in spite of significant variation in initial 
conditions can be informative...in the same way it is informative to 
learn, regarding a given causal explanation of the First World War, 
that a world war would have come about...even if no bomb had ex- 
ploded in Sarajevo” (ibid., p. 251). To be sure, it would be wrong to 
think that the number of states X is not “drives” X toward equilib- 
rium. But finding out about the (typical) “inevitability” of thermody- 
namic behavior does carry with it modal and explanatory force. Sy, 
quantifies this modal force. 

CRAIG CALLENDER 
London School of Economics 


™ “Probability, Explanation, and Information,” Synthese, xLvi (1981): 233-56. 
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ings, and the like are problematic, perhaps because there can- 
not be identity conditions for meanings unless there is an 
analytic/synthetic distinction, and there is not We are not propos- 
ing to consider whether the notion of content identity can be 
made metaphysically respectable. But we want to discuss a pro- 
posal about how to save semantics if it does turn out that “same 
meaning” cannot be reconstructed; it is one that is currently popu- 
lar in both the philosophical and the cognitive science literature.’ 
The idea is that the purposes semantic theories are supposed to 
serve do not actually require principles of content tndtviduation, all 
they require is a notion of content similanty. On such a view, similar- 
ity of meaning is primitive relative to sameness of meaning, and the 
latter might be of interest only as a limiting case of the former, per- 
haps, indeed, a limiting case that is never achieved in practice. 
Meaning similarity is real and robust, there is a lot of it around, and 
it is metaphysically prior to synonymy. So the story goes. 
We have two points to make here. First, even if a notion of con- 
tent similarity were on offer, it would arguably be of no use for doing 


I: is old news that the identity conditions for contents, mean- 


' Texts where this sort of suggestion is endorsed are ubiquitous. For some ex- 
amples: Gilbert Harman, Thowght (Princeton: University Press, 1973); Ned Block, 
“Advertisement for a Semantics for Psychology,” in P. French, T. Uehling, H. 
Wettstein, eds., Midwest Studies tn Philosophy, Volume 10: Studies tn the 
Mtnd (Minneapolis: Minnesota UP, 1986), pp. 615-78; S. Sloman and LJ. Rips, Sim- 
danty and Symbols in Human Thinking (Cambridge: MIT, 1998); Stephen Such, “On 
the Ascripnon of Content,” in A. Woodfield, ed., Thought and Obd (New York: Ox- 
ford, 1981), pp. 153-206; Ray Jackendoff, “Conceptual Semantics,” ın U. Eco, M. 
Santambrogpio, P. Viol, eds., Meantng and Meantng Representations (Bloomington: 
Indiana UP, 1988), pp. 81-97. 
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the kinds of things that a theory of meaning ought to do. Quite gen- 
erally, the kinds of explanations that semantic theories are supposed 
to give would not survive substituting a similarity-based notion of 
content for an identity-based notion. This is unaccidental; it is the 
result of ways in which content similarity and content identity are in- 
trinsically different kinds of relations. Second, we shall argue that, 
even if a similarity-based notion of content could in principle re- 
place an identity-based notion salve the explanatory power of seman- 
tic theories, that would not be much cause for rejoicing. For, asa 
matter of fact, nobody has the slightest idea how to construct the re- 
quired notion of content similarity; and the reasons for this are 
much the same, mutatis mutandis, as the reasons why nobody has 
the slightest idea how to construct a perspicuous notion of content 
identity. Until recently, we would have thought this claim was unten- 
dentious. But Paul M. Churchland,’ in a number of articles over the 
last decade, has proposed what he says is a schema for such a con- 
struction. That he is quite wrong to say this has been the burden of 
several papers of ours, to which Churchland has now likewise 
replied. Our view is that his reply is an egregious ignoratio elenchi. 
The second part of the present discussion is our attempt to make 
clear why it is. 
I. WHY CONTENT SIMILARITY IS PROBABLY OF NO USE 

A question that is not asked nearly as often as it should be is this: If 
you had a tenable notion of similarity of meaning, what exactly 
would you do with it? We suspect that friends of similarity-based 
content more or less take for granted that, given such a notion, they 
could just plug it in wherever semantic theories have hitherto availed 
themselves of synonymy, content identity, intentional equivalence, 
and the like. It is (we continue to suspect) because the friends of 
meaning similarity assume this that they rarely bother to show how a 
semantics built around content similarity could do anything like the 


* “Some Reductive Strategies in Cognitive Neurobiology,” in A 

tional Perspectsoe' The Nature of Mend and the Structure of Sasnce (Cambridge: MIT, 
1989), pp. 77-110; our Hoksm: A Shopper's Guide (Cambridge: Blackwell, 1992), 
chapter 6; Churchland, “Fodor and Lepore: State-space Semantics and Meaning 
Holism,” Philosophy and Phenomenological Research, LI (1993): 667-72; our “Reply to 
Churchland,” Philosophy and Phenomenological Research, 110 (1993): 679-83; Church- 
land, “Second Reply to Fodor and Lepore,” in R McCauley, ed., The Churchlands 
and Ther Critics (Cambridge: Blackwell, 1996), pp. 278-83, Churchland, “Concep- 
tual Similarity across Sensory and Neural Diversity The Fodor/Lepore Challenge 
Answered,” this JOURNAL, XCV, 1 ( January 1998): 5-32. 
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explanatory work that semantic theories are supposed to do.’ In 
fact, a plausible prima facie case can be made against this assump- 
tion; so, anyhow, we are about to argue. 

We can think of four main purposes for which one might want a 
theory of meaning and for which such theories have historically been 
employed. We do not deny that there may be other uses to which 
they could be put; but we do think it is plausible that, if a theory of 
meaning failed all of the following, it would not be worth having. 


1 Satsfacthon conditions. A semantc theory for a language ought to as 
sign satisfaction conditions to its semantically evaluable expressions; 
likewise, mutatis mutandis, for a theory of the intentional content of 
semantically evaluable mental states.‘ Moreover, it should turn out 
that sentences that we are all quite certain are true are true under 
the preferred assignments of satisfaction conditions. For example, 
ceteris paribus there would be something badly wrong with a seman- 
tic theory that assigned truth conditions to ‘Sparrows are birds’, “Wa- 
ter is wet’, ‘George Washington was president’, and so on, according 
to which any of them expresses a falsehood. 

2. Composttionahty, A semantic theory should make clear how the satis- 
faction conditions for complex linguistic expressions are determined 
by the satisfaction conditions of their grammatical parts, thereby ex- 
plaining why natural languages are typically productive and system- 
atic. Likewise, a semantics for propositional attitudes should make 
clear how the satisfaction conditions for beliefs, desires, and the like 
are determined by those of their constituent concepts, thereby ex- 
plaining why minds are typically productive and systematic. 

3. Translation. Pretheoretic intuition has it that meaning is what good 
translations preserve. A semantic theory should provide a notion of 
meaning according to which this turns out true. 

4. Intenhonal explanation. A semantic theory should reconstruct a no- 
tion of content that is adequate to the purposes of intentional (for 
example, belief/desire) explanation 


We are about to argue that, quite likely, a similarity-based theory of 
meaning would not do any of these four kinds of things; not even if 
one assumes, qua semantic realist, that an identitybased theory of 


* Arguing for this assumption is a burden which semantc aeunatrusis, unlike 
the friends of content mmilarity, are of course not required to discharge. There 
are people around, including Churchland himself in some of his moods, who ap- 
parently cannot decide whether their metaphysical bottom line us realism about 
content similarity or skepticism about meaning. Our discusmon is only concerned 
with the former doctrine. 

* We assume that linguistic expressions and propositional attitudes are both se- 
mantically evaluable; and we are neutral as to which, if either, is ontologically 
prior We shall generally go back and forth between the semantics of languages 
and the semantics of mental states as expository convenience suggests. 
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meaning would do the work if only the individuation problem for 
contents could be solved. 

LI. Satisfaction conditions. Consider, to begin with, any statement 
(or sentence or thought) you like which you are prepared to believe 
is atomic, has the logical form Fa, and is true. Nothing much except 
ease of exposition turns on how notions like ‘logical form’ and 
‘atomic statement’ are deployed; you choose. (In fact, it does not 
even matter all that much how you construe the notion ‘true’; we 
shall return to this.) Take, for example, the sentence (S1): 


S1. Nixon is dead. 
By our lights, 

(i) (S1) is certainly true. 

(ui) ‘Nixon 1s dead (at t)’ is true if and only if Nixon is dead (at t). 

(iii) No semantic theory (for English) could be adequate unless it en- 

tails (ii) and is consistent with (i) 
Query: What would a content-similarity based semantics say about 
the truth conditions of (S1)? 

Well, we do not know because, of course, there are not really any 
such theories on offer. But we guess the story would go something 
like this. Think how an old-fashioned, identity-based semantic the- 
ory goes about entailing (ii). Roughly, it assigns the individual 
Nixon to the singular term ‘Nixon’; it assigns the property of being 
dead to the expression ‘is dead’, and it interprets sentences of the 
logical form Fa to mean that the individual designated by ‘a’ has the 
property expressed by ‘F’. Well, likewise, mutatis mutandis, accord- 
ing to the new dispensation; except that, for example, we assign to 
the expression ‘is dead’ not the property of being dead, but a range 
of properties all of which are similar to being dead, and we assign to 
‘Nixon’ a range of individuals all of whom are similar to Nixon.’ 

But, clearly, to proceed in this fashion is simply to give up the 
hope of assigning satisfaction conditions to formulas in a way that 
makes, for example, sentences like (ii) come out true. For example, 
we suppose that being comatose on one’s death bed is pretty similar 
to being dead; but it is not the case that, if Nixon is comatose on his 


* From here on, we ignore issues about context senmtivity, in particular, we shall 
not worry about implicit indexicals. 

* Perhaps, however, names express individual concepts, in which case ‘Nixon’ is 
amigned some property mmilar to the one that ‘Nixon’ expresses. We are not at all 
clear what a mmilarity-based semantics should say about singular terms. Since the 
options for predicates seem a bit clearer, from here on we shall generally run the 
discusion on them 
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death bed at ¢ then Nixon is dead at t If you do not believe us, ask 
Doctor Kevorkian. 

We take it to be just self-evident that properties constructed out of 
semantic similarity will not do what notions like ‘expresses’, 
‘denotes’, ‘means’, and so on are supposed to do in statements like 
‘Nixon’ denotes Nixon, ‘is dead’ expresses the property of being 
dead, ‘is dead’ means is dead, and the like. The problem is simple 
enough: if two expressions denote, express, or mean the same, then 
they are coextensive; but if two expressions only denote, express, or 
mean something similar, they need not be (and typically are not) co- 
extensive. And, it is at the heart of the standard way of assigning sat- 
isfaction conditions, that what replaces ‘F’ on the left-hand side of 
instances of the schema ‘Fa is true if and only if a is F’ must be at 
least coextensive with what replaces it on the right-hand side. The 
moral seems to be that part of the price of switching from a content- 
identity based semantics to a content-imilarity based semantics is 


giving up the idea that theories of meaning specify the pretheorett- 
cally correct conditions of semantic evaluation of thoughts, sen- 
tences, and the like.’ 

A last consideration under this general head: you might suppose 
that you could use a similarity-based semantics to assign satisfaction 
conditions in something like the traditional way if you are prepared 
to give up an absolute notion of fruth in favor of some correspond- 


” We are assuming that the project 1s to replace a semantics based on inten- 
sional identty with one based on intensional similarity; but we are also 
that the usual noton of identity of extension survives. That is, it is not part of the 
story that the revisionist semantics would appeal to some notion of similarity of ex- 
tension where traditional semantics adverts to extenmonal identity. (As far as we 
know, none of the proposals for a similanty-based reconstruction of the notion of 
intenson on offer in the literature does call for a similarity-based reconstruction 
of the notion of extension: either the issue is not discussed, or the conservative 
notion of extension is assumed. See, for example, Hilary Putnam, “The Meaning 
of ‘Meaning’,” in Mind, Language, and Reahty (New York: Cambridge, 1975), pp. 
215-71.) 

In fact, we cannot imagine how a mmilarity-based notion of extension would go. 
The problem is that the similanty of two extensions would presumably have to be 
grounded not in their dagres of overlap, but in the similarity of the rndrorduals that be- 
long to their extensions, and who knows how this notion is to be explicated? Thus, 
for example, the extension of ‘male college senior’ 15, PaE “more similar” 
to the extension of ‘female college senior’ than it is to the extension of ‘rock’ or of 
‘prime number’ But none of these sets overlaps at all. By contrast, the set {male 
truck drivers} is less similar to the set {male college students} than it is to the set {fe- 
male college students}, though the first and second probably overlap and the sec- 
ond and third do not. 

We shall, however, briefly consider a case of a mmilarity-based semantic theory 
which proposes to dispense not only with identity of meaning but also with identity 
of truth value. See immediately below 
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ingly approximate notion. On this view, for example, the right truth 
sentence for (S1) might be not (ii) but (iv). 


(tv) ‘Nixon is dead’ is stwalar to being true (that is, it is roughly true) if 
and only if Nixon is similar to being dead. 


We do not actually know of anybody into meaning similarity who 
clearly endorses this, nor are we sure what, exactly, such a proposal 
would amount to. But whatever (iv) means, there are surely coun- 
terinstances to the schema formulas of the form Fa are roughly true if 
and only if a ts similar to F. The basic problem is that ‘similar to’, un- 
like ‘roughly true’, is, of necessity, relativized to respects. Thus, it is 
true (we suppose) that dogs are similar to cats; but it is not even 
roughly true that dogs are cats; or that our cat is a dog,...and so on. 
The equivalence also fails in the other direction. It is roughly true 
that you cannot be the Pope if you were born in the Bronx. But be- 
ing the Pope and not being born in the Bronx are not at all the 
same sort of thing. Both of the present authors are instances to the 
contrary. 

I.2. Composthonakty. Compositionality is the idea that, in the typi- 
cal case, when syntactically complex expressions (/thoughts) are se- 
mantically evaluable, their satisfaction conditions are determined by 
the satisfaction conditions of their syntactic constituents. And it is 
generally agreed that the crucial condition that must be satisfied in 
order that compositionality should hold for a class of expressions is 
that the satisfaction conditions of their constituents should be con- 
text independent. So, the fact that ‘brown dog’, ‘green dog’, “brown 
cat’, and ‘yellow cat’ are all compositional in English is part and 
parcel of the facts that: ‘brown’ means the same in the environment 
‘_ dog’ that it does in the environment ‘__cat’; ‘dog’ means the 
same in the environment ‘brown...’ that it does in the environment 
‘green...’; and so on. Likewise, mutatis mutandis, for ‘brown Poodle 
dog’ and the like. That compositionality requires context indepen- 
dence is, to repeat, the consensus view, and we simply take it for 
granted in what follows. 

Now, the claim that the meaning of a constituent is context inde- 
pendent is essentially the claim that schema (S2) is valid 


S2. If mis part of the meaning of ‘a’ and ‘a’ is a constituent of ‘b’, then 
mis part of the meaning of ‘0’. 


* Reading ‘part’ as ‘improper part’. So, in particular, the whole meaning of a 
syntactically primitive expression counts as ‘part of’ its meaning and as part of the 
meaning of any complex expression of which it 8 a constituent 
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The point of present interest, however, is that (S3)—which is what 
you get if you substitute ‘is similar to part of the meaning of’ for ‘is 
part of the meaning of’ throughout (S2)—is surely false: 


S3. If m is mmilar to part of the meaning of ‘a’ and ‘a’ 1s a constituent 
of ‘b’, then mis similar to part of the meaning of ‘b’. 


So, presumably, the meaning of ‘cat’ is similar to part of the mean- 
ing of “The leopard is on the mat’ (cats and leopards are both fe- 
lines) and part of the meaning of “The leopard is on the mat’ is 
similar to part of the meaning of “The explosion caused a lot of dam- 
age’ (leopards and explosions are both dangerous). But the mean- 
ing of ‘cat’ is not similar to any part of the meaning of ‘The 
explosion caused a lot of damage’. 

The main argument is, we think, simple and obvious: the standard ex- 
plication of the compositionality of content depends crucially on assum- 
ing that the semantic property of a constituent which is preserved when 
it is embedded in its host is content identity, not just content similarity. 

L3. Translation. The relation translates, like the relation ‘is similar 
to the meaning of’, is plausibly intransitive. For this reason, lots of 
friends of content similarity think that, whatever its flaws may be, it is 
at least likely to be useful in the theory of translation. And, indeed, 
there are some who think this is the oniy work a semantic theory can 
legitimately be asked to do because facts about translation are the 
only bona fide semantic facts there are. 

We think, however, that this line of thought is extremely dubi- 
ous, at least for the case of infinite languages (namely, languages of 
infinite expressive power), which we suppose to be the only case 
worth caring about. Suppose that a translation theory between L, 
and L, is a finitely specifiable, computable function from the sen- 
tences of one to the sentences of the other, under which similarity 
of meaning is preserved. Then, we think there cannot be such a 
theory unless L, and L, are compositional. The basic idea is pretty 
obvious. Suppose the languages are infinite and the translation 
theory is finite. Then presumably the translation theory works by 
first listing the translation relations among the primitive bases of 
the two languages (it says that ‘chat’ means the same as ‘cat’ and 
that ‘chien’ means the same as ‘dog’, and so forth) and then pro- 
viding recursive procedures for constructing the translation rela- 
tions among the infinitely many complex expressions. But, we 
suppose, this will work only where, in each of the two languages, 
the semantics of the complex expressions is determined in a regu- 
lar way by the semantics of the primitive basis; that is, if the lan- 
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guages are semantically compositional.’ If this line of thought is 
right, then the arguments that hold against similarity-based compo- 
sitional theories (see above) are also arguments against similarity- 
based translation theories. 

1.4. Intentional explanations. Paradigm intentional explanations ap- 
peal to principles that control the relations (rational, or causal, or 
both) between propositional attitudes with identical or overlapping 
contents. Thus, the ‘practical syllogism’ is supposed to be captured 
by some such schema as (S4): 


S4. If a(gent) desires that P and believes that not-P unless Q (where a 
believes that Q is contingent on an action that he believes himself 
able to perform), then, ceteris paribus, a tries to bring it about 
that Q. 


Prima facie, (S4) is sound only if the formulas that substitute for P 
are identical or synonymous throughout. (It is plausible that weaker 
relations like, for example, logical equivalence are insufficient even 
when they preserve the extensions or the relata.) Our point is that 
(S4) is patently unsound if substitutions for the schematic variables 


preserve only stmilanty of meaning. 

Being a zerkon is similar to being a diamond; zerkons are manu- 
factured to ensure that this is so. But Marilyn Monroe did not think 
that zerkons are a girl’s best friend, and she was right not to think so. 
Likewise, Chateau Rothschild is not all that different from Chateau 
Plonk; but there are many who are prepared to pay for the one what 


* But what if translation turns out to be a syntactic relanon? That is, what if 
the translation of L, expressions into L, expressions is fully determined given 
their respective syntactic structural descriptions (including, of course, their 
lexical inventories; see, for example, Stephen Schiffer, Remnants of Meantng 
(Cambridge: MIT, 1987); Fodor, A Theory of Content Aiea MIT, 1990), 
chapter 7; Lepore, “Conditions on Understanding Language,” Anstotetian Socr 
ety XCVI, 1 (1996): 41-60). This is perfectly all right with us, but no- 
tice that it is no help to somebody who says that translations ipso facto preserve 
sirmilarity of meaning. The claim that syntax determines translation is not at all 
the same as claiming that syntax is what (good) translation preserves; what trans- 
lation preserves, according to simularity-based semantics, is similarity of con- 
tent. Well, if it is to do so in the case of translation between productive 
languages, infinitely many meaning sumilanty relations among complex expres- 
sions must all be projected from finitely many meaning similarity relations 
among primitive expressions. But, as we remarked in the text, it looks like 
meaning similarity is generally not preserved under the recursive processes that 
construct complex expressions from simpler ones; in general, the semantic 
property such processes can preserve is content idenisty. If this is right, a syntac- 
fic translation procedure could not be expected to preserve the semanie proper- 
ties of expressions in the languages between which it operates if both 
languages are productive and the only semantic property defined for their ex- 
pressions is meaning similarity. 
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they would not pay for the other. Likewise, you may have a strong 
preference as to which twin you marry, even if the similarity between 
the twins is notable. And so on. None of these asymmetries of pref- 
erence need indicate a palate that is excessively refined. To the con- 
trary, the mediation of responses that discriminate sharply between 
similar situations is a fair part of what cognition is for. A landscape 
without a ravening tiger may be quite similar to a landscape with 
one. Tigers make a living out of that; it is part of the story about why 
they have stripes. It may thus matter a lot which of two quite similar 
landscapes you are in. Your cognitive mechanisms are there in large 
part to help you to find out. 

The underlying principle in all this is entirely obvious: 

If x and y are identical, then they are identical tout court. But if x 

and yare (merely) similar, then there is always a respect in which x and 

y are similar, and it can matter to their rational or causal relations 

what respect this is. 
Here is another way to make much the same point. Presumably, 
F= GU (F= G). But, of course, the corresponding condi- 
tional for similarity is invalid. The result is that the counterfactu- 
als that identity statements guarantee are not, in general, 
consequences of the corresponding similarity statements. We as- 
sume that ‘P explains Q’ is true only where ‘if P, then Q’ does 
support appropriate counterfactuals. (Likewise, mutatis mutan- 
dis, for ‘P confirms Q’.) All this being so, it is hardly surprising 
that, in general, you cannot replace content identity with content 
similarity in an intentional theory salve explanatory power. There 
may be some way for a similarity-based semantics to save the kind 
of intentional explanations that the practical syllogism provides; 
but we do not ourselves know how; and we have not seen any sug- 
gestions in the literature. 

We end this section by remarking that it is our guess that noth- 
ing we have said so far would move Churchland in the least. 
What he would reply, we continue to guess, is that we have failed 
to realize how very radical his revisionist proposals actually are: 
they are intended to abstract not only from the traditional idea 
that the central semantic construct is content identity, but also 
from the traditional inventory of problems that semantic theo- 
ries are supposed to solve. Well, so be it. But we, in turn, are 
not moved to exchange the old way of doing semantics for a new 
way on the grounds that the latter, though it will not solve what 
used to be thought the main problems about meaning, does 
promise to solve some new problems that it is unfortunately not 
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Figure 1: A rudimentary face space of three dimensions” 


now possible to enumerate. A pig in a poke is what it sounds like 
to us. 
I. WHY CHURCHLAND HAS NO NOTION OF CONTENT SIMILARITY 


Thus far our topic has been what role a similarity-based notion of 
meaning might (or might not) be able to play in a semantic theory. 
Suppose, for the sake of the argument, that our doubts about this 
are misplaced and that an account of meaning similarity would be 
a very good thing to have. We turn to our second topic, which is 
why we do not think Churchland has got one. 

This requires a bit of background. Let us start with the notion of a 
semantic vector space. The basic idea is illustrated by Figure 1 above. 
This figure is supposed to provide a very simplified example of a 
kind of representational system that might reconstruct the percep- 
tual similarity of faces. Roughly, there is an n-dimensional space in 
which each representation of a face occupies a proprietary region. 


After Churchland, The Engrs of Reason. The Seat of the Soul (Cambridge: MIT, 
1995), p. 28, his figure 2.5. Reproduced with permission of the author. 
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The dimensions correspond to properties of the faces that are 
salient for the perception of similarity (and/or for similarity judg- 
ments). In the present case, the space is assumed to be three di- 
mensional for expository simplicity, “but this is highly unrealistic 
—our coding space for faces probably has at least twenty dimen- 
sions—but it does evoke the wide range of faces one can discrimi- 
nately code with only a few resources.” | The intended 
interpretation of the notation is, in effect, that locating a given face 
Fat a given position Pon a given dimension D corresponds exactly 
to representing Fas having the property D to the extent P. The 
empirical claim is that, given the right choice of dimensions, the 
propinquity of the items in a space predicts their perceived or 
judged similarity.” 
Several quick points about this kind of representation: 


(1) It does not matter that, in the example, a region of the space is 
identified with a representation (/concept) of a particular face. 
Any representation, either of an individual or of a property, would 
do equally well, so long as there 1s a natural and “psychologically 
real” choice of salient dimensions along which to estimate the rela- 
tive similarity of the things it represents. 

(2) In fact, to claim that a concept can be properly represented as a re- 
gion in a vector space really amounts to no more than claiming that 
there does exist a natural and motivated choice of such dimen- 
sions. Looked at this way, the differences between vectorspace en 
coding and more familiar accounts of meaning is easy to 
exaggerate. For example, a vectorspace encoding is entirely com- 
patible with the traditional doctrine that concepts are “bundles” of 
semantic features. Indeed, the latter is a special case of the former, 
the difference being that, whereas semantic features are generally 
thought of as having binary values, semantic dimensions are al- 
lowed to be continuous. Unsurprisingly, in light of this, semantic 
feature theories yield putative metncs of content similarity just as 
vector-space theories do: namely, the relative similarity of two con- 


" Churchland, thd., p. 29. 

* Churchland apparently thinks that the idea of developing a geometrical sys- 
tem of representation, in which concepts, contents, and the like are identified with 
regions in a multidimensional semantic space is novel and indigenous to connec- 
tionist models of mind. “[It] contrasts sharply with the kinds of representanonal 
and processing strategies that...cognitive psychologists...have traditionally ascribed 
to us"—-A Neurocomputational Perspactroe (Cambridge: MIT, 1991), p. 171. In fact, 
scrnantic spaces have been around for quite a long while; see, for example, CE. 

GJ. Sua, and P.H. Tannenbaum, The Measurement of Meaning (Urbana: 
Hlinois UP, 1967) Most cognitive psychologists who worked with them eventually 
concluded that there is not any question to which they are the answer. We think 
they were right to conclude that. 
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cepts corresponds to a (possibly weighted) sum of the features they 
have in common. 


(3) The plausibility of identifying propinquities in a space with similar: 
ties of content depends entirely on the choice of the dimensions. 
If, for example, you change one of the dimensions in Figure 1 
above so that what it expresses is not (for example) the width of a 
face's nose but the weight of the face’s owner, the propinquities in 
that space will not predict judgments of perceptual similarity of the 
faces. This is the avatar, in the geometrical notation, of the per- 
fectly general consideration that estimates of similarity must always 
be relativized to “respects.” 
So much, then, for the system Churchland wants to employ for anno- 
tating content relations among representations. Our main claim, in 
earlier phases of the exchange with him (see footnote 2), was that 
moving to this notation requires rephrasing a number of traditional 
cruxes that arise for identity-based theories of meaning, but that 
none of them is either resolved or bypassed. We briefly recapitulate 
the objections we raised. 

As we have just seen, similarity of content is specified only with re- 
spect to a choice of the interpretation of the dimensions of a seman- 
tic space. There are three questions to be raised about such choices, 
all of which are patently counterparts to classical worries about indi- 
viduating content in identity-based meaning theories. 

II.1. How is the choice of semantic dimensions to be constrained? 
For example, what determines that we are to taxonomize faces by the 
width of the noses they contain, rather than (for example) noses by 
some relation among the faces that contain them? Or, more plausibly, 
why is it right (if it is) to taxonomize the apple prototype by reference 
to its location in a space of colors (prototypical apples are red) rather 
than the red prototype by reference to its location in a space of fruits 
(paradigm red is apple-red). This is exactly parallel to the problem 
that arises in identity-based meaning theories about what constrains 
the choice of primitive features, as in: Why is it right (if it is) to define 
‘mother of’ in terms of ‘parent of’ instead of the other way around? 

Notice that this question is just as urgent for vector theories as it is 
for feature theories. For neither offers an interpretation of similarity 
relations, or indeed, of any semantic relations, among the primitive 
features/dimensions. Rather, they purport to capture relations of 
similarity of concepts relative to the primitive features/dimensions 
(that is, similarity in respect of the properties that the primitive fea- 
tures/dimensions express). Hence, ceteris paribus, the bigger the 
inventory of primitive features/dimensions they posit, the more se- 
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mantic facts they fail to represent. In the limiting case, every concept 
corresponds to a primitive feature (that is, to a binary valued dimen- 
sion); and the theory becomes, in effect, disquotational. This sort of 
point has not, of course, gone unremarked in discussions of theories 
of meaning. The old-speak way to put it was: the notion of a seman- 
tic feature analysis of the lexicon is trivialized if there is allowed to be 
a distinct feature for each lexical item (so that the semantic analysis 
for the concept DOG says only that it is the unique concept that has 
the value plus on the dog feature). Nothing about this changes when 
you go over from binary features to continuous ones. 

We suggested, earlier in the exchange with Churchland,” that the 
reason this sort of problem seems not to bother him is that, like so 
many cognitive scientists, he believes deep down that only sensory 
properties can be semantic dimensions. This does solve the prob- 
lem, but at the cost of a recidivist and deeply implausible empiri- 
cism. If Churchland has now got a better way out, we cannot find it 
in his text. 

H.2. What are the truth makers for claims about the semantic inm- 
terpretation of the dimensions? A second problem we raised for 
Churchland (in “Reply to Churchland”) was that the only meta- 
physics of meaning he seemed to have on offer was one according to 
which the inferential roles of representations are the truth makers 
for claims about their semantic properties; and that could not be the 
right story about the semantic properties of the dimensions them- 
selves. 

As we saw above, to locate a concept (or a prototype) C with re- 
spect to a dimension D of a space is, in effect, to claim that the infer- 
ence from being C to being D is constitutive (statistically or 
definitionally) of C. But if you thus identify the content of C by ref- 
erence to its inferential relation to D, you cannot also identify the 
content of D by reference to its inferential relation to C. (You could, 
to be sure, identify the content of D with its position in some higher- 
order semantic space; but that would initiate a regress.) This is, once 
again, exactly parallel to a classic worry about identity-based mean- 
ing theories: if the contents of complex concepts are identified by 
their inferential relations to primitive concepts, the content of the 
primitive concepts has to be identified in some other way; and nobody 
knows what other way that is. 

As far as we can tell, Churchland is now clear that the notion that 
contents are somehow individuated by their inferential roles will not 


3 Holism: A Shopper's Guide. 


394 THE JOURNAL OF PHILOSOPHY 


work for the content of the dimensions. Rather, he thinks, the se- 
mantics of the dimensions is somehow to be reconstructed in terms 
of causal relations between, on the one hand, the representations 
that have values on the dimensions and, on the other, facts about 
the world. We have no objection to his taking this way out; but he 
does now need some reason for not treating such causal relations as 
the truth makers for all claims about the semantic properties of rep- 
resentations, thereby giving up, inter alia, the identification of con- 
tents with positions in semantic spaces. If (for example) the redness 
dimension has the content it does in virtue of causal relations be- 
tween red things and the mental representation RED, why should 
the mental representation DOG not have the content it does in 
virtue of its causal relation to dogs? Note the exact correspondence 
to a traditional question about identitybased meaning theories of 
the ‘two factor’ variety: If it is to be assumed that a one-factor, causal 
theory works for the semantics of the primitive expressions, why is it 
to be denied that a one-factor causal theory will work for the seman- 
tics of all the expressions? 

if.3. Strictly speaking, the vector theory identifies similarity of con- 
tent only with respect to concepts that have rankings on the same di- 
mensions. But it is implausible that this is true of the concepts people 
actually entertain. This raises a problem for similarity-based meaning 
theories which is exactly parallel to the traditional problem identity- 
based theories have about how to avoid relativizing the individuation of 
concepts to the belief systems in which the concepts are embedded.“ 

Suppose, for example, your concept NEBRASKA is just like mine, 
except in the following respect: you think that Nebraska is about a 
thousand miles from Guadalajara; whereas I, never having heard of 
Guadalajara, have no views about the distance between Nebraska and 
it. So, the situation is that your semantic space, unlike mine, has a d+ 
mension proximity to Guadalajara along which the concept NEBRASKA 
has a location. Question: How much (if any) of this kind of thing can 
be tolerated, consonant with our concepts of NEBRASKA being simi 
lar? We emphasize that the vector notation as such offers no answer 
to this question. Transdimensional similarity (similarity among con- 
cepts in spaces which differ in dimensionality) is undefined; just ex- 


4 Maybe you do not mind if concepts are relativized to the behef systems in which 
they are embedded? So be it. But, of course, the semantic relativizahon problem 
lives just across the street from the semantic holism problem: If you have to relatmıze 
the identification of concepts to minds that have some of their behefs in common, 
how do you avoid having to relativize the identification of concepts to minds that 
have all of their beliefs in common? Our pount is that, if this worry is real, it persists 
unaltered when you change from a traditional semantics to a geometrical semantics. 
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actly as (lacking an analytic/synthetic distinction) an inferentialrole 
semantics offers no interpretation of concept identity for minds that do 
not share all of their beliefs. 

Likewise, some dimensions presumably count more than others 
do in similarity evaluations, and the relative weight of the dimen- 
sions differs for different concepts. So, at least in principle, it is pos- 
sible for minds to differ either in respect to their weightings or in 
respect of the interaction between their weightings and the con- 
cepts, or both. Although you and I agree that literacy is a salient di 
mension for evaluating philosophers and engineers, I think it is 
important for philosophers to be well read, but it does not matter 
much whether engineers are; whereas you think it goes the other 
way around. Is there a principled answer to the question how 
(dis) similar this makes our concepts of engineers and philosophers? 
If so, what is the relevant principle? 

There is, in short, a plethora of problems about how the similarity 
of concepts is to be evaluated across semantic spaces whose dimen- 
sionalities differ in various ways. These are just the old problems 
about the relativization of concepts to belief systems; except that, in 
the new notation, talk about the way that a concept is positioned 
with respect to the dimensions of a semantic space replaces talk of 
the way that a concept is embedded in a system of beliefs. 
Whichever notation you prefer, you have the same problems about 
how to identify concepts and the like in the sorts of circumstances 
that typically hold across msnds (or ‘languages’, or ‘theories’). 

We suspect that such problems are unsolvable unless the ana- 
lytic/synthetic problem can be solved. Nonetheless, as far as we can 
tell, this is the issue to which Churchland takes his new theory to be 
germane. We are going to argue that this is a massive ignorato elenchi 
on Churchland’s part. The proposal he has on offer is not of the right 
logical type to be a solution to the problem of transdimensional con- 
tent individuation; and, though there is a problem in the general 
vicinity that it ts of the right logical type to solve, it does not, in fact, 
solve that problem either. 

Churchland’s proposal co-opts work by Aarre Laakso and Gary 
Cottrell,” which, he says, “constitute[s] a decisive answer to Fodor 


* “Qualia and Cluster Analyms Assessing Representatonal Similanty between 
Neural Systems” (unpublished manuscnpt, 1998). 
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and Lepore's challenge.... [Laakso and Cottrell] successfully deploy 
one member of a large family of mathematical measures of concep- 
tual similarity, measures that see past differences—even extensive 
differences...,""° and now, given the nature of the challenge that we 
had offered (namely, that there is a relativization problem for vector- 
space semantics just like the relativization problem for identity- 
based semantics) one expects this sentence to end “of the semantic 
dimensions of the conceptual spaces.” For, to repeat, a vector-space 
semantics identifies concepts by reference to their positions in a se- 
mantic space; and the problem we raised—the one that Churchland 
apparently thinks Laakso and Cottrell have solved—was precisely 
‘How can minds that do not have the same semantic spaces have the 
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Figure 2. The activation space of the cells at the middle layer of NETtalk.” 


“ Paul Churchland and Patrica S. Churchland, On the Contrary. Critical Essays, 
1987-1997 (Cambridge: MIT, 1998), p. 81; and “Conceptual Similanty across Sen- 
sory and Neural Diversity,” p. 5. 

" From Churchland, The Engine of Reason, p. 88, his Figure 4.22. Reproduced 
with permission of the author. 
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same (or similar) concepts?’ Here, however, is the way the sentence 
that we just quoted actually does end: “measures that see past differ- 
ences...in the connectivity, the sensory inputs and the neural [our 
emphasis] dimensionality of the networks being compared” (1bid.). 

Now, something has gone wrong here. Neural spaces and seman- 
tics spaces are quite different sorts of things. As we have been seeing, 
positions along the dimensions of the semantic space relative to 
which a concept is situated correspond to propositional attitudes the 
holding of which is supposed to be constitutive of having the concept 
whereas the dimensions in a neural space correspond to neurons; and 
the positions along a given dimension specify the degree of excitation 
of the corresponding neuron (at a ime).” Unsurprisingly, though 
he is not reliable in observing it, this difference does show up in 
Churchland’s notation now and then; see Figure 2. Notice that the 
dimensions of thts cube are labeled with the names of neurons, not 
with the names of semantic dimensions (compare Figure 1 above). 

Now, just as there is a relativization problem about the identity (or 
similarity) of concepts and the like across semantic spaces of different 
dimensionality, so, too, there is a relativization problem about the 
identity (or similarity) of brain states across neural spaces of different 
dimensionality. For example, if you are a connectionist, you identify 
circuits with ntuples of nodes and links; and the nodes and links in a 
network are themselves identified by the totality of their relations to one 
another. On this picture, the notion “same (/similar) circuit” is de- 
fined only for networks of identical dimensionality, just like the no- 
tion “same (/similar) content.” 

We think that this analogy between individuation problems has 
caused Churchland to confuse the question of how to provide a 
transdimensional notion of brain-state identity with the question of 
how to provide a transdimensional notion of content identity: it is 
because he has failed to observe this distinction that he seems indif- 
ferent whether the dimensions of the spaces he talks about are se- 
mantic or neural. It is likewise because of this confusion, and 
because Churchland thinks (wrongly; see below) that Laakso and 
Cottrell have solved the transdimensional individuation problem for 
brain states that he believes (again wrongly) that Laakso and Cottrell 
have solved the transdimensional individuation problem for content 
states. So we shall now argue. 


" If we simplify (as, indeed, connectionist models often do) by assuming binary 
dimensions as opposed to ones with real number values, a state description relative 
to a neural space is an assignment of 1 or 0 to each neuron depending on whether 
it 18 active (at t) 
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Let us do this slowly. There is, to repeat, a problem about type in- 
dividuating brain states across neural spaces of different dimension- 
ality,” just as there is a problem about individuating contents across 
semantic spaces of different dimensionality. But it cannot simply be 
taken for granted that a proposed solution for the first of these prob- 
lems would solve (or even help to solve) the second. Suppose you 
have some scheme for picking out homologous brain regions across 
neural spaces of different dimensionality and that you are convinced 
that it preserves similarity (or identity) of neural types. That is, ho- 
mology is defined so that if there is a region of an n-dimensional 
space such that: 


(1) All the brain states in this region are neurologically similar or iden- 
tical; and if 

(2) for every space of m-dimensions (n C m) there is a homologous re- 
gion such that, if a brain state has a location in the n space it hasa 
location in the homologous region of the m space; then: 

(3) the brain states in the homologous regions of m-space are likewise 
neurologically similar or identical. 


That is, of course, a very strong condition on a notion of transdimen- 
sional homology for regions of neurological spaces. But our point is 
that, even if you had a notion of transdimensional homology which, 
in this sense, preserved neural state similarity (identity), you would 
have no right to assume that it likewise preserves semanitc similarity 
(or identity). If you want to claim that it does, you will need to pro- 
vide an argument. Roughly, you need to show why, if the brain 
states in a region of a certain neurological space are similar or iden- 
tical in content, then so, too, are the brain states in a region of any 


* For that matter, there 1s a problem about typing brain states across neural 
spaces with the same dimensionality if the dimensions of the spaces are supposed to 
correspond to neurons as anatomically indrividuated. There really is, we think, a 
dilemma hereabouts. If a neuron is a unit (for example, neurons are 
undividuated by their connectivity and the like), then there is a holism problem for 
them just as there is for blatantly functional entities like circuits. On the other 
hand, if neurons are anatomical units, there is no guarantee that brains that have 
the same neural populations are ever in the same brain states by any functional cri- 
terion. As far as we can tell (for example, from the labeling of Figure 2 (his Figure 
4.22)) Churchland prefers the first born of this dilemma; that 1s, he (implicitly) 
takes the individuation of neurons to be senuative to their functions. This raises, 
but does not solve, the question how exactly the dimension labels of his neural ac- 
tivaton spaces are to be interpreted; that is, whch functional properties ‘being 
neuron A and the like express. 

Begging the question whether, or to what extent, the ‘neurons’ in their networks 
are supposed to be anatomical as well as funchonal units is a standard practice in 
connectionist texts. It makes it seem to uninitiates that connectioniam is brain sci 
ence, not just computational psychology. That 1s very good for getnng grants, but 
it does cause a lot of confusion. 
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higher-dimension neurological space that is homologous under the 
preferred trandimensional mapping.” 

Notice that taking physicalism for granted (which, for the sake of 
the argument, we are happy to do) does not provide the argument 
that is required. “Token” physicalism says that, if individual brain 
states are identical, then so, too, are the corresponding individual in- 
tentional states; but token physicalism says nothing about whether 
there are such things as transdimensional counterparts of brain-state 
tokens; or whether, if there are, they ipso facto share any of their se- 
mantic properties; or, if they do, which ones. Likewise, “type” physi- 
calism guarantees that, if brain states are of the very same kind, then 
so, too, are the corresponding content states. But it makes no com- 
mitments about what, if any, content relations obtain for brain states 
that are merely transdimensionally homologous according to some or 
other typological scheme. Churchland must be assuming a typology 
of brain states that meets the condition that when neurological simi 
larity (or identity) is preserved under a transdimensional mapping, 
content identity is preserved, too. Nothing else would warrant his 
persistent practice of putting semantic labels on the dimensions of 
neural spaces and vice versa. But, to repeat, that semantic similarity 
is preserved transdimensionally when neural similarity is preserved 
transdimensionally does not follow from any kind of physicalism we 
have heard of, nor would it seem to be remotely plausible on inde- 
pendent grounds. 

Here, to summarize, is the dialectical situation as we understand it 
so far: Churchland thinks that Laakso and Cottrell have provided a 
notion of “kind of brain state” which preserves similarity of neural 
state types across spaces which differ in their neural dimensionality. 
And he (tacitly) infers that they have thereby provided a transdimen- 
sional notion of content-state similarity as well. But, to put it mildly, 


» It us worth noting that the claim that content-state types correspond to brain- 
state types under some or other criterion of individuation of the latter is implausi- 
bly strong quite aside from issues of transdtmenssonal individuation. In partcular, 
such a clum would require that wherever states of mind overlap in their inten- 
tonal contents, there 1s also an overlap of the corresponding brain states So, for 
example, there would have to be a (consttuent) brain state common not just to to- 
kens of the thought that ts red, but also to tokens of the thought I have decided to 
parni the cat red, and to tokens of the thought red ıs as far from green as bius is from yet- 
low, and so forth. There is, as far as anybody knows, not the slightest reason for 
supposing that the relanon between mental contents and brain states meets this 
constraint even if issues of transdimensionality are ignored. In fact, as things now 
stand, there are no known examples of cases where it does. It is a nice irony that ıt 
fals notoriously to be satisfied by the relation between circuits in connectionist 
networks and the contents that the circuits are supposed to encode. That is exactly 
why connectionists have so much trouble with compositionality. 
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this inference is tendentious. Churchland needs an argument for 
drawing it, but provides none. We doubt that he has one that he is 
shy about producing. Rather, our guess is that he has simply con- 
fused semantic spaces (of which the typical occupants are intentional 
states, so described) with neural spaces (of which the typical occu- 
pants are brain states, so described). 

All right so far. We now propose to show that, in fact, the sort of 
criterion Laakso and Cottrell suggest for type individuating brain 
states across dimensionality differences pretty clearly does not pre- 
serve either identity or similarity of the contents of the states." Bast 
cally, Churchland’s idea is that, even if my neural space has fewer 
dimensions than yours, we can embed the one in the other so long 
as the dimensions of my space are a subset of the dimensions of 
yours. In which case, we can assign each of my (actual or possible) 
neural states to the same state type as some (actual or possible) 
neural state of yours. A fortiori, we can type identify (at least some) 
brain states across (at least some) differences among neurological 
spaces. That, as far as we can tell, is the burden of the following re- 
mark: 


If a pair of dimensionally diverse spaces both contain...a same-shape m 
dimensional hypersolid..., then both spaces must each either constitute 
or contain an D subspace, an »D hyperplane of some onentation or 
other, that confines each hypersolid at issue. Since those respective hy 
perplanes...have the same dimensionality, there will be no problem in 
comparing the two solids.” 


We paraphrase (under correction, to be sure): 


Clarm Q: If my neural space is properly embedded in yours, then (ce- 
teris paribus) my (actual or possible) brain states are a subset 
of yours. 


Suppose, for the moment, that Q is true; hence that, if a neural 
space of lower dimensionality is embedded in a neural space of a 
higher dimensionality, then each brain state that has a location in 
the former space has a corresponding location in the latter. Our ar- 
gument will be that, even if Q is true, it does not provide for the transds 
mensional individuation of content states. That is because R—which is 


™ A fortiori, if the Laakso and Cottrell sort of view is right about how to mdrvid- 
uate brain-state types, then type physicalism is false about the relation between 
brain-state types and intentional-state types. (Type physicalists should not be 
alarmed by this, however, we shall see presently that the proposal on offer for 
transdimensionally indrviduating brain-state types does not work either.) 

2 Churchland, On the Contrary, p. 91; and “Conceptual Similarity across Sensory 
and Neural Diversity,” pp. 13-14. 
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the analog of Q for semantic spaces—is clearly false; content identity 
is not preserved under the embedding of one semantic space in an- 
other: 


Clam R: If my semantic space is embedded in yours, then (ceteris 
paribus) my (actual or possible) concepts are a subset of yours. 


Here is a candidate counterexample to R (the Nebraska/Guadalajara 
case discussed above is another). Suppose I have only three semantic 
dimensions hard-soft, black-white, heavylight. And suppose my con 
cept ROCK is identified with a vector that specifies a region in the 
space that these dimensions define. Likewise for you: your semantic 
space also has these dimensions (inter alia); and you too have a com 
cept that lies within the boundaries of the space that they define. The 
difference between us is that, whereas these dimensions define the 
whole of my semantic space, they define only an embedded subspace of 
yours. Now, does it follow that our concepts ROCK are similar? We 
should not have thought so. For perhaps your space has a dimension 
ANIMACY which, by assumption, mine lacks. And suppose that you 
think that rocks are actually quite animate; much more animate, say, 
than turtles; at least as animate, indeed, as half the members of the 
D.A.R. Is it stl the case that your ROCK concept is similar to mine? If 
so, suppose also that your space also contains a dimension for abstract 
ness, and that you think that rocks are pretty abstract; at least as ab- 
stract as the natural numbers, say, though maybe less abstract than the 
real numbers. Is your concept ROCK stl similar to miner If there are 
any such cases where the right answer is “no,” Churchland loses. 

Our point is that, even if concepts, semantic properties, intentional 
states, and the like can be identified with positions in a state-space, 
and even if position in that state-space is preserved when the latter is 
embedded in a space of higher dimension, it does not follow that 
those (or indeed any) semantic properties, intentional! states, and so 
on can be identified with positions in the higher-order space. That is 
because the conditions for the individuation of contents are nonmo- 
notonic. This, too, is something that everybody already knew, in- 
deed, it is just the analytic/synthetic problem in one of its familiar 
guises. Presumably Homer has a concept of water which was similar 
to ours as far as it went.” (Water is wet, it is potable, it freezes in the 
winter,...and so forth.) There remains the question whether Homer’s 
concept of water was similar to ours tout court, and the relevant con 


» Assuming, as Churchland of course does, that what concepts you have de- 
pends on what beliefs you have. 
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siderations are much the same ones that the corresponding question 
about the identity of our concepts would turn on: our WATER con- 
cept is embedded in a theory (a Weltanschaung, a form of life; blah, 
blah, blah) many of whose dimensions are quite different from any of 
the ones in Homer’s semantic space. What, if anything, follows about 
which concepts we share with him? 

Recapitulation: even if we did have a transdimensional notion of 
“braimstate similarity,” it would not follow that we would then have a 
transdimensional notion of “content similarity.” That is, even if re- 
gions of a lower dimensional space can be securely correlated with 
semantic interpretations, there is no reason why such a correlation 


should survive any proposed scheme of transdimensional mapping. 
I. CONCLUSIONS 


We close by remarking that, putting questions about content to one 
side, Churchland’s sort of story really does not work even for the 
transdimensional typing of brain states. Q is false because the iden- 
tity conditions for neuralkstate types, like the identity conditions for 
contentstate types, is nonmonotonic. A brain state that is identified 
with a position in a neural space of m dimensions need not be type 
identical with any region in an n-dimensional neural space in which 
the mspace is embedded. This is because brain-state types are func- 
tionally individuated; and functional individuation is itself nonmo- 
notonic.™ 

Suppose we have a neural space of m dimensions; and suppose 
that the function of a certain activation state of these neurons is to 
cause a flight response. Now, embed the mspace in a one of higher 
neural dimensionality (which is to say: put the same m-neurons ina 
bigger brain, preserving their connectivity to one another). It surely 
does not follow that there is any activation state of these (or, indeed, 
of any) neurons in the larger brain whose function is to cause flight 
responses. Indeed, it does not even follow that activation states de- 
fined over the original m-neurons have any function at all in the 
larger brain; that depends, inter alia, on what other neurons they are 
connected to. It is, for example, perfectly possible there is no physio- 
logically distinguishable system that all or only the original m-neu- 
rons belong to when they are in the bigger brain.” 


H We are not, please note, asserting—or even suggesting—that content individ- 
uation is nonmonotonic becouse it is a species of functional individuation. Heaven 


forfend. 

= Suppose, however, that it 1s denied that the individuation of kinds of brain 
states is functional. Even so, it is far from obvious that either identity or similarity 
of brain-state lands would be preserved under the sort of transdimensional map- 
ping Churchland imagines. It is patently not the case that brain states that are sim- 
ilar m a given world because they share certain nonfunctional properties are ipso 
facto similar in any world in which they have those properties in common. There 
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Brief conclusion: the question we were worried about in our ear- 
lier exchanges with Churchland was: ‘How can there be content simi- 
larihes between minds that have different beliefs?’ We were worried 
about this because Churchland keeps saying that similarity semantics 
avoids the main problems with which analytic/synthetic semantics 1s 
plagued, and our understanding is that ‘How can there be content 
identities between minds that have different beliefs?’ is the main 
problem with which analytic/synthetic semantics is plagued. 

But the question Churchland actually offers an answer to is: ‘How 
can creatures be in similar brain states if they have different num- 
bers of neurons?’ This is an interesting kind of question, to be sure; 
but it has nothing in particular to do with meaning. To the contrary, 
it arises for any theory that is committed to the functional individua- 
tion of things over which it quantifies. This is so whether or not the 
things over which it quantifies are psychological/intentional/seman- 
tic. In particular, the question of transdimensional functional state 
individuation arises for any theory that is committed to kinds of 
states that can have multiple realizations. Compare ‘How can tubes 
and transistors both be amplifiers?’, ‘How can jets and diesels both 
be engines?’, ‘How can a wood thing and a plastic thing both be 
pawns?’, and so on. 

It would be nice to have a transdimensional, similarity-preserving, 
typeidentity criterion for brain states, and it would also be nice to 
have a transdimensional, similarity-preserving typeidentity criterion 
for content states. And it would be very nice (and very surprising) if 
the two criteria turned out to be the same. But, as far as anybody 
knows or has any reason to believe, these two individuation problems 
raise quite different ontological issues, and their solutions—if, in- 
deed, they have got solutions—are likely to be largely independent 
All they have in common, as far as we can tell, is that both kinds of 
individuation are nonmonotonic, and that Churchland has not got a 
workable account of either. And nor, of course, has anybody else. 


JERRY FODOR 
ERNIE LEPORE 


Rutgers University 


may be ways of choosing canonical neural dimensions that meet this condition, but 
there is no a priori reason to believe that there are. To put this another way: it is 
one question whether brain states are individuated by their function, it is quite a 
different question whether, however they are individuated, they are reduacble to 
neural states, 
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ON A SUPPOSED COUNTEREXAMPLE TO MODUS PONENS* 


n “A Counterexample to Modus Ponens,” Vann McGee! contends 
that modus ponens is not strictly valid. As he understands the 
rule, it tells us that from an indicative conditional, together with 
its antecedent, one can infer its consequent; and it fails, he argues, 
when applied to a conditional whose consequent is itself a condi 
tional. McGee presents the following example in support of this 
claim: 
Opinion polls taken just before the 1980 election showed the Republi 
can Ronald Reagan decisively ahead of the Democrat Jimmy Carter, 
with the other Republican in the race, John Anderson, a distant third. 
Those apprised of the poll results believed, with good reason: 


(1) Ifa Republican wins the election, then if it’s not Reagan who wins 
it will be Anderson. 
(2) A Republican will win the election. 


Yet they did not have reason to believe 


(3) If it’s not Reagan who wins, it will be Anderson (ied, p. 462; num- 
bering added). 


He adduces two other similarly ingenious examples, having essen- 
tially the same structure as the one just cited. 

Alhough McGee’s examples are indeed arresting, I do not think 
that they establish his conclusion that modus ponens is invalid. 
What makes the examples plausible, I believe, is the assumption that 
the conditionals occurring in them satisfy the law of exportation, a 
principle which says that a conditional of the form 'If ¢ and y, then 
x’ entails one of the form 'If ¢, then if y, then y. McGee, in fact, 
claims that the law of exportation, as well as its converse, the law of 
importation, are valid for English indicative conditionals. He con- 
tends, in addition, that the law of exportation and modus ponens 
cannot both hold for indicative conditionals, and he is right about 
this, given the manner in which he construes indicative conditionals. 
He is mistaken, however, in defending exportation at the expense of 
modus ponens: it is doubtful that there is any conditional that sus- 
tains exportation (or importation) but not modus ponens. In what 
follows, after showing that McGee’s examples rely for their plausibil- 


* I am grateful to Jim Brown, Jack Canfield, Elmar Kremer, Dons Olin, and 
David Widerker for helpful comment and suggestions. 
1 This JOURNAL, LOH, 9 (September 1985): 462-71. 
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ity on the law of exportation, I argue that this law is simply not valid 
for conditionals of the sort McGee has in mind. I then show that, 
quite aside from the matter of exportation, McGee's contention that 
an inference like the one above counts against modus ponens hinges 
on untenable ideas about conditionals and their interpretation. Fi- 
nally, I offer an explanation of why the law of exportation, though 
an intuitively plausible principle of conditional reasoning, does not 
hold for the relevant class of conditionals. 
I 

A rule of inference is valid just in case it is truth preserving, that is, 
just in case it will never take us from premises that are true to a corr 
clusion that is not. Accordingly, the inference from (1) and (2) to 
(3) constitutes a counterexample to modus ponens only if (1) and 
(2) are true while (3) is not. McGee allows that, if each of (1) and 
(3) is taken as a material conditional, that is, a truth-functional condi- 
tional that is true if and only if it has either a faise antecedent or a 
true consequent, then the inference is unproblematic; for construed 
in that way, his example is an argument having both true premises 
and a true conclusion (thd, p. 464). He claims, however, that it is a 
mistake to suppose, in general, that an English indicative conditional 
has the truth conditions of a material conditional and, in particular, 
that (3) has them; and he adduces as evidence for the latter claim the 
observation that, if we were familiar with the relevant polls, we would 
not for a moment suppose that (3) is true, even though we might be 
resigned to the idea that the antecedent of (3) is false (ibid, p. 464). 

Doubtless, many will agree with McGee that the ordinary indica- 
tive conditional is not susceptible to a truth-functional analysis and 
that (3) is false despite having a false antecedent. Perhaps. It is easy 
to see that it would be misleading and, hence, inappropriate to as- 
sert (3) simply on the ground that its antecedent is false, but 
whether (3) would be false tn such circumstances is another matter." 
In my view, one can make a strong case for the idea that indicative 
conditionals should be taken truth functionally; but for present pur- 
poses, we may put this issue to one side. I find it doubtful that there 
is any consistent interpretation of the conditionals occurring in 


! In his 1967 William James Lectures, H. P. Grice attempts to reconcile our use 
of the indicative conditional with a truth-functional analysis, using principles of 
conversation; the lectures have been published in his Studies tn the Way of Words 
(Cambridge: Harvard, 1989). 
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McGee's example, truth functional or otherwise, which will make the 
premises true and the conclusion false.’ 

We see that the inference from (1) and (2) to (3) constitutes a 
counterexample to modus ponens only if (3) is a false conditional, 
in which case it cannot be construed as a material conditional. Let 
us suppose, accordingly, that the conditionals occurring in this in- 
ference are not truth functional and, moreover, that (3) is false. 
Clearly, (2) is true. But what about (1)? It is clear that (1) is true 
if it is taken as a material conditional. But what reason is there for 
supposing that (1) is true if it is taken as a strong conditional, as I 
shali call it, that is, as an indicative conditional taken non-truth- 
functionally? McGee observes that someone familiar with the rele- 
vant polls is likely to believe (1), and do so with good reason. Of 
course, sometimes we repudiate a premise when we realize that it 
leads to an utterly incredible conclusion. According to McGee, 
this is not the case here; he contends that even in the presence of 
(3), one is not inclined to relinquish (1) (op. cit., p. 463). This is 
certainly correct, but it does not, by itself, show that the condi- 
tional we are prepared to accept, when we accept (1), is a strong 
conditional, rather than a material conditional. And, of course, if 
(1) is taken truth functionally and (3) not, then the inference from 
(1) and (2) to (3), though having true premises and a false conclu- 
sion, is not an instance of modus ponens: since the sentence ‘If it’s 
not Reagan who wins, it will be Anderson’ does not have the same 
meaning in the premise in which it occurs as it does in the conclu- 
sion, it would be an equivocation to infer (3) from (1) and (2).* 

If, as McGee contends, the law of exportation is a valid principle 
for strong conditionals, we have a good reason for thinking that (1) 
is true taken as a strong conditional. In the circumstances under 
consideration, we are interested in the question of how the election 


* Walter Sinnott-Armstrong, James Moor, and Robert Fogelin note that while 
McGee says there 1s no reason to believe that (3) 1s true, he never explicitly argues 
that it is false—“A Defense of Modus Ponens,” this JOURNAL, Dod, 5 (May 1986): 
296-300. They point out that for McGee’s example to be a counterexample to 
modus ponens, (3) must be false, adding that (3) “can be true even if there is no 
reason to believe it” (p. 297). (They go on to suggest that McGee's target is not 
modus ponens at all, but rather an epistemic or probabilisuc analogue of that 
rule.) It is doubtful, however, that anyone would think that (3) is tue unless she 
were to give it a tuth-functional reading, and, as I indicated earher, McGee does 
explicitly argue that it is implausible to construe the indicative conditional as a mæ 
tenal conditional. However this may be, I think that modus ponens is defensible 
on any coherent reading of the mdicatrve conditional. 

1 E. ] Lowe makes this last point in “Not a Counterexample to Modus Ponens,” 
Anahsis, XLVO, 1 (January 1987): 44-47. Although Lowe is certainly correct that 
McGee's example does not impugn modus ponens if (1) is taken as a material con 
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will turn out (as opposed to, say, how the primaries might have 
turned out). There are exactly two Republican candidates, Reagan 
and Anderson. Thus, if we assume that a Republican other than 
Reagan will win the election, we shall naturally conclude that Ander- 
son will win, and rightly so. In other words, it is entirely reasonable, 
and perfectly correct, to believe that: 


(4) If a Republican wins the election and it is not Reagan who wins, 
then it will be Anderson. 


This last conditional is certainly true when construed as a material 
conditional, but it is also true construed as a strong conditional. 
Given the law of exportation, however, we may infer (1) from (4). 
Therefore, if the law of exportation holds for strong conditionals, 
(1) is tue when taken as a strong conditional. 

It looks as though two intuitively plausible principles of classical 
logic, modus ponens and the law of exportation, are in conflict, and 
a potential casualty of the clash is the strong conditional. We are as- 
suming that the conditionals that figure in McGee’s example are 
strong conditionals and, in addition, that (2), the second premise, is 
true, while (3), the conclusion, is false. We have just seen that (4), 
taken as a strong conditional, is true. If the law of exportation is a 
valid principle for strong conditionals, then the inference from (4) 
to (1) shows that (1) is true; but if (1) is true, McGee’s example has 
true premises and a false conclusion, in which case modus ponens is 
not strictly valid. On the other hand, if modus ponens is beyond re- 
proach, then the inference from (1) and (2) to (3) shows that (1) is 
false; but if (1) is false, the inference from (4) to (1) has a true 
premise and a false conclusion, in which case strong conditionals do 
not satisfy the law of exportation.” We must conclude that at least 
one of these principles fails for the strong conditional. Since both 
principles are valid for the material conditional, and since both are 
intuitively plausible, someone might read these results as an argu- 
ment for abandoning the strong conditional altogether: perhaps 


ditional, I do not see that he provides a convincing reason for thinking that it must 
be taken as a material conditional Lowe contends that it u more plausible to sup- 
pose that the consequent of (1) is a matenal conditional than it 1s to suppose that 
the inference in question invalidates a deeply rooted prinaple of deductive infer- 
ence such as modus ponens. 

* Doris Olin uses an example similar in structure to McGee’s to draw the conclu- 
sion that exportation is not valid for subjunctive conditonals—“Newcomb’s Prob- 
lem. Further Invesngations,” American Philosophical Quarterly, xm, 2 (April 19°76): 
129-38. She argues that while the conditional ‘If 7 will live 20 years after t and I 
were to jump off the Empire State Building at t (with no protective devices), then J 
would become the object of intense medical interest’ is true, the conditional ‘Tf I 
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English indicative conditionals simply have the truth conditions of a 
material conditional after all. Although I would not find this conclu- 
sion disagreeable, the problem is that while it would solve our imme- 
diate difficulty concerning the likes of (1), (3), and (4), a similar 
one would arise in connection with subjunctive conditionals, and it 
is hard to imagine that we would resign ourselves to the idea that, for 
example, the counterfactual (subjunctive) conditional ‘Had Reagan 
not won, Anderson would have’ is true. 

Evidently, if we are to preserve a strong conditional, we must relin- 
quish one of these principles of classical logic. McGee recognizes 
that modus ponens and the law of exportation will not both come 
out valid on a nonclassical account of the conditional; he thinks, 
however, that we have good reasons for preferring exportation to 
modus ponens. He says, on the one hand, that we have “explicit ex- 
amples to show that the indicative conditional does not satisfy modus 
ponens” (op. at, p. 466) and, on the other, that we have good induc- 
tive evidence that the law of exportation holds for indicative condi 
tionals: 

In general, we assert, accept, or believe a conditional of the form tf $, 

then if y, then 0! whenever we are willing to assert, accept, or believe 

the conditional 'If @ and y, then 6! It appears, from looking at exam- 
ples, that the law of exportation ...is a feature of English usage (op. at, 

pp. 46465). 


He adds that “[i]t would appear that the law of importation, the con- 
verse of the law of exportation, is also valid” (op. cit, p. 465, footnote 
5). McGee goes on to claim that any theory of conditionals which 
denies the law of exportation, such as Robert Stalnaker’s,° cannot be 
entirely correct (op. cit., p. 466). 

Obviously, if (1) and (2) are true and (3) is false, we have an ex- 
ample showing that conditionals of the relevant sort do not satisfy 
modus ponens. It is a mistake, however, to imagine that McGee’s ar- 
gument against modus ponens stands on its own, independently of 


will live 20 years after 4 then if I were to jump off the Empire State Building at t 
(with no protective devices), 7 would become the object of mtense medical inter 
est’ is not; she says that “even though ‘J will live 20 years after i’ is true, we would 
not want to grant ‘If Z were to jump off the Empire State Building at ¢ (with no pro- 
tective devices), [would become the object of intense medical interest’” (p. 131). 

* Stalnaker sets out his account of conditionals in “A Theory of Conditionals,” in 
Studses m Logical Theory, American Philosophical Monograph Serves, Number 2 (Malden, 
MA: Blackwell, 1968), pp. 98-112; reprinted in W. L. Harper, Stalnaker, and G. 
Pearce, eds., Ifs (Boston: Reidel, 1981), pp. 41-55. On Stalnaker’s account, a con 
ditional 'If œ, then w! is true in possible world i just in case w um true in the possible 
world most similar to 1 in which @ is true. Thus, a conditional of the form IIF ¢, 
then if y, then 7! is true in i just in case its consequent 'If y, then x! is true in the 
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exportation. We have little reason to accept (1) as a true strong con- 
ditional, unless we are prepared to accept the law of exportation, 
and if we have little reason to accept (1) as a true strong conditional, 
then we have little reason to accept the inference from (1) and (2) 
to (3) as a counterexample to modus ponens. The idea that this in- 
ference discredits modus ponens and the idea that exportation is le- 
gitimate for strong conditionals stand together, at least in this sense: 
unless we accept exportation, we have little reason to accept the idea 
that the inference counts against modus ponens. In short, the exam- 
ple under discussion depends on the background assumption of ex- 
portation for its plausibility. 
0 

McGee may be right that we are apt to accept a conditional of the 
form 'If @, then if y, then 6! when we accept one of the form 'If ¢ 
and y, then @. We are certainly inclined to accept (1), and I think 
that we do so because we accept (4). This evidence, however, does 
not warrant the conclusion that the law of exportation is valid for 
strong conditionals; nor is that conclusion true. 

One reason for doubting that the linguistic evidence cited by 
McGee shows that the law of exportation holds for strong condition- 
als is based on the converse of that law, the law of importation. Cer- 
tainly, if we reflect on evidence of the sort that McGee sets out, as 
well as on our pre-analytic intuitions, we shall find as much reason to 
accept importation as exportation: intuitively, it is just as plausible to 
infer a conditional of the form ‘If @ and y, then 6! from one of the 
form 'If œ, then if y, then 61, as the other way around. We can show, 
however, that law of exportation and the law of importation cannot 
both apply to strong conditionals; for in the presence of several stan- 
dard assumptions about conditionals, the law of importation is in- 
consistent with the view that (1) and (2) are both true and (3) is 
false. One assumption is that an indicative conditional of the form 
If @, then y is logically true just in case @ logically implies y. Be- 
cause every sentence logically implies itself, it follows that any sen- 
tence of the form 


@-world (as we might call it) most similar to 4 which is to say, on this account, just 
in case x is true in the wworld most similar to ¢-world most similar to i Conse- 
quently, the truth of a conditional of the form 'If @ and y, then 7! does not ensure 
the truth of one of the form 'If ¢, then if w, then yi: the former will be true in i, 
and the latter false in , if x 1 true in the '@ and y!-world most similar to 1 but false 
in the world most mmular to the @world most similar to 1, which is possible if 
there are @-worlds (in which w 1s not true) and wworlds (in which ¢ 1s not true) 
more similar to : than the closest 'ġ and w-world. 
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(5) Lf Gif @ then y), then (if @ then w) 


is logically true. But assuming the law of importation, a sentence of 
the form of (5) entails one of the form 


(6) If (if ¢ then y) and @, then y 


Since any sentence having the form of (5) is logically true and since 
(5) logically implies (6), it follows that any sentence having the form 
of (6) must be logically true as well. It should be clear, however, 
that, if the antecedent of (6) logically implies the consequent of (6), 
then modus ponens is a valid rule of inference. But as we saw ear- 
lier, if modus ponens is valid for strong conditionals, exportation is 
not. Consider, for example, the result of replacing @ in (6) with (2) 
and y with (3): 


(7) If (if a Republican wins the election, then if it is not Reagan who 
wins it will be Anderson) and a Republican wins the election, then 
if it is not Reagan who wins it will be Anderson. 


Since (7) is logically true and since a conditional is logically true 
only if its antecedent logically implies its consequent, it follows that 
it is not possible for the antecedent of (7)—that is, the conjunction 
of (1) and (2)—to be true and the consequent of (7)—namely, 
(3)—to be false. Therefore, given that (2) is true and (3) is false, we 
have to abandon the idea that (1) is true. But since (4) is true, this 
conclusion contradicts the view that the law of exportation holds for 
strong conditionals. Therefore, if the law of importation holds for 
strong conditionals, then the law of exportation does not. 

This piece of reasoning shows that one cannot consistently hold 
(as McGee does) both that importation is valid and that the infer- 
ence from (1) and (2) to (3) is not. It also shows that, if importation 
is valid for strong conditionals, then exportation is not. We have 
not, however, demonstrated that importation is valid for strong con- 
ditionals; so this objection, while showing that McGee’s account re- 
quires modification, is not necessarily fatal to exportation. But the 
conclusion that exportation and importation do not both hold for 
strong conditionals casts doubt on McGee’s grounds for thinking 
that the law of exportation does: inasmuch as the linguistic evidence 
favoring importation is essentially of the same character and quality 
as that favoring exportation, it is hard to see how we can discount 
the former and not the latter. 

There is, in fact, a compelling reason for thinking that the law of 
exportation does not hold for strong conditionals, which is that, if it 
does, the strong conditional is truth functional. Again, since a con- 
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ditional is logically true if and only if its antecedent logically implies 
its consequent, it follows that any conditional of the form: 


(8) If (either @ or y) and not-¢d, then y 


is logically true. But in the presence of the law of exportation, (8) 
logically implies: 
(9) Ifeither ¢ or y, then if not-¢, then y 


Hence, any conditional having the form of (9) is logically true. Sub- 
stituting ‘It is Reagan who will win’ for @ in (9) and ‘It will be Ander- 
son who wins’ for y, we obtain: 


(10) If either Reagan wins or Anderson wins, then if Reagan does not 
win, then Anderson will. 


Since (10) is logically true, it follows that it is not possible for the an- 
tecedent of (10) to be true and the consequent false. But since a dis- 
junction is true if at least one of its disjuncts is, the antecedent of 
(10) is true; hence, the consequent must be true as well. The conse- 
quent of (10), however, is, simply (3), which we have been assuming 
is false. Since (8) is unexceptionable, it seems that we must either 
surrender the law of exportation, which sanctioned the step from 
(8) to (9), or drop the critical assumption that (3) is false. In fact, 
the pattern of inference from (8) to (9) reveals a more general diff 
culty, namely, that in the presence of exportation, any conditional of 
the form 'If not- then y is true if either ¢ is true or y is, which is to 
say that a conditional is true if it has a false antecedent or a true con- 
sequent. All that is required for the conclusion that 'If not-¢ then y! 
is fully truth functional is the observation, whose truth is uncontro- 
versial, that the converse of (9) is logically valid as well, that is, that a 
sentence of the form ‘If not- then w' logically implies one of the 
form 'Either @ or yl. We may infer, as a result, that the law of expor- 
tation holds only for those conditionals having the truth conditions 
of the material conditional; therefore, it does not hold for strong 
conditionals if, as we have been assuming, they are to be distin- 
guished from material conditionals. 

Two points emerge. The first, which may be surprising given the 
conflict between modus ponens and exportation noted earlier, is 
that, if the law of exportation (or importation) holds for some class 
of conditionals, then so does modus ponens. And the second, which 
may explain why it looks as though both will not come out valid for 
any nonclassical conditional, is that the law of exportation simply 
does not hold for strong conditionals. 
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Now that we see that the law of exportation does not hold for strong 
conditionals, we may return to the question of whether the inference 
from (1) and (2) to (3) counts against modus ponens. We agreed that 
(2) is true and alowed that (3) is false, and we asked whether there is 
any reason for supposing that (1) is true if it is taken as a strong condi- 
tonal. McGee's observation that anyone familiar with the relevant 
polls would certainly accept (1) prompted the further question of 
whether there is any reason for thinking that the conditional we ac- 
cept, when we accept (1), is a strong conditional Without the law of 
exportation, I cannot see that there is much reason for thinking that it 
is. Of course, it does not follow from this that (1), taken as a strong 
conditional, is false. There is, however, a decisive reason for thinking 
that it is, at least given our other assumptions, and, hence, for thinking 
that the inference from (1) and (2) to (3) does not constitute a prob- 
lem for modus ponens. McGee contends that it is a notoriously diffi- 
cult matter to specify an account of the truth conditions of the English 
indicative conditional Perhaps so; but it is easy to see that (1) cannot 
be true. For it is an essential feature of any conditional, indicative or 
otherwise, that it is false if it has a true antecedent and false conse- 
quent. The antecedent of a conditional specifies conditions that are 
supposed to be sufficient, in some sense, for those specified by the con- 
sequent, accordingly, no sentence can count as a true conditional of 
any sort if the former conditions obtain but the latter do not. The an- 
tecedent of (1), however, is simply (2), which is in fact true, and the 
consequent of (1) is (3), which we have agreed is false; hence, (1) isa 
false conditional.’ Therefore, the inference from (1) and (2) to (3) is 
not a counterexample to modus ponens. 

It follows that McGee’s example does not threaten modus ponens 
even if we take the constituent conditionals as strong conditionals. 
Although I agree that in the relevant circumstances, we are apt to ac- 
cept (1) and (2) and reject (3), I do not think that we are prepared 
to accept a conditional known to have a true antecedent and false 


’ Other considerations point to the same conclusion. The sentence (1') ‘Tf “A 
Republican will win the election” is true, then “If it is not Reagan who wins it will be 
Anderson” us true’, which us logically equivalent to (1), says that, if (2) is true (which 
in fact it is), then (3) is true (which m fact it is not). How can the assumption that 
some sentence is true, when it actually is true, have any bearing on the truth value of a 
sentence that is false? Ifa sentence is m fact false, then it is false given any and every 
truth; so if (3) is false, as we have been assuming, then it is false given that (2) is true. 
But the claim that (3) 1s false if (2) is true n inconsistent with the claim that ($) is true 
if (2) is true, which is what (1’) says, hence, (1’) is false, in which case so u (1). 
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consequent. According to an influential suggestion made by F. P. 
Ramsey,’ one should evaluate a conditional by reflecting on certain 
conditional beliefs: add the antecedent of the conditional as an as- 
sumption to your current stock of beliefs, making whatever minimal 
revisions are necessary for consistency, and then evaluate the conse- 
quent in light of this adjusted set of beliefs. The leading idea, which 
has considerable plausibility, is that one should accept, or reject, a 
conditional just in case one is prepared to accept, or reject, the con- 
sequent under this hypothetical set of beliefs. Commentators have 
pointed out the shortcoming that it is difficult to specify a plausible 
set of constraints that will, in general, determine a unique way of ad- 
justing one’s initial beliefs to accommodate the assumption of the 
antecedent (especially when the antecedent is inconsistent with 
those beliefs). However this may be, it is not difficult to see that 
when we already accept the antecedent of a conditional, our ad- 
justed stock of beliefs will be exactly the same as our initial stock of 
beliefs. Consequently, the question of whether one should accept, 
or reject, some conditional, when one already accepts the an- 
tecedent, is exactly the same as the question of whether one should 
accept, or reject, the consequent of that conditional. We see, ac- 
cordingty, that (1) and (3) must have the same truth value, given the 
truth of (2), and that it would be irrational for someone who recog- 
nizes the truth of (2) to accept the former and not the latter. This 
shows, I think, that the conditional that we are prepared to accept, 
when we accept (1), does not have (3) as a false consequent and, 


thus, is not a strong conditional. 
IV 


McGee’s example, it turns out, points to a defect in the law of ex- 
portation rather than modus ponens. As we saw earlier, the law of 
exportation holds only for material conditionals; as we see now, the 
inference from (4) to (1) is a clear counterexample to the law. Ac- 
cordingly, we have ample grounds for concluding that the law of 
exportation is not valid for strong conditionals. But even if we are 
agreed that the law of exportation does not hold for strong condi- 
tionals, we may be puzzled that it breaks down in this way. In fact, 
one may wonder why the argument presented earlier in support of 
(4) does not apply to (1) as well: in the relevant circumstances, 
there are exactly two Republican candidates, Reagan and Ander- 
son. We begin with the assumption that a Republican will win the 


* “General Propositions and Causality,” in his Foundation of Mathematics and Other 
Logical Essays (New York: Humanities, 1950), p. 248. Stalnaker’s account in “A The- 
ory of Conditionals” is presented in part as a lopment of Ramsey's proposal. 
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election; we then add the assumption that Reagan will not win. It 
seems reasonable to infer, in the presence of these assumptions, 
that Anderson will win the election. Accordingly, we concluded 
that (4) is true. Why does this reasoning not speak in favor of (1) 
as well as (4)? 

Using considerations introduced earlier, we can explain why the 
reasoning does not apply to (1) and, hence, why exportation is in- 
valid. If we examine (4) and (1) employing a model for evaluating 
conditionals like the one suggested by Ramsey, we shall see that 
while (4) and (1) introduce the same assumptions, they do not in- 
troduce them in the same way. To evaluate (4), we must adjust our 
initial stock of beliefs—which includes the beliefs that there are two 
Republican candidates, Reagan and Anderson, that Reagan will win, 
that a Republican will win, and that Anderson will come in a distant 
third—-to make room for the assumption that a Republican other 
than Reagan will win. Accordingly, we must discard the belief that 
Reagan will win and make other modifications as well, but one be- 
lief we are not at liberty to drop is the belief that a Republican will 
win: obviously, we are unable to add the assumption that a Republi- 
can other than Reagan will win unless we retain our belief that a Re- 
publican will win. Thus, we must evaluate the consequent of (4) in 
light of the belief that a Republican will win, together with the 
other beliefs in our adjusted set, in particular, that the only other 
Republican candidate is Anderson. In order to evaluate (1), on the 
other hand, we must first look at the consequent of (1), that is, (3), 
in light of our initial stock of beliefs adjusted to include the an- 
tecedent of (1), that is, (2); of course, since we already accept (2), 
our adjusted stock of beliefs will be exactly the same as our initial 
stock of beliefs, which is why (1) and (3) have the same truth value. 

But now someone may wonder why (3) turns out false, for it may ap- 
pear that when we add the antecedent of (3) to our stock of beliefs, the 
resulting hypothetical set of beliefs will be the same as the one we had 
when we added the antecedent of (4) to our initial stock of beliefs. It 
may appear, in other words, that we evaluate the consequent of (3) and 
the consequent of (4) under the same hypothetical set of beliefs. But 
then, since the consequent of (3) and the consequent of (4) are one 
and the same, it would not be possible for (4) to be true and (3) false.’ 


* More generally, it may that when we evaluate the consequent of a condr 
tional of the form 'If @ and y, then y! and the consequent of the consequent of 
‘If œ, then if y, then gi, we do so under the same hypothetical set of beliefs, 
namely, our initial stock of beliefs adjusted to include both @ and y, in which case 
the truth of the former would ensure thar of the latter and the law of exportation 
would be sustained. 
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It is this line of thought that gives the law of exportation its illu 
sion of cogency; it is, however, based on a mistake. To evaluate (3), 
we must adjust our stock of beliefs—which, again, includes the be- 
liefs that there are two Republican candidates, Reagan and Ander- 
son, that Reagan will win, that a Republican will win, and that 
Anderson will come in a distant third—to make room for the as- 
sumption that Reagan will not win. We can accommodate this as 
sumption in one of two ways, by abandoning the belief that a 
Republican will win or by abandoning the belief that Anderson will 
come in a distant third. The former alternative was not available in 
the case of (4), for the antecedent of (4) includes the assumption 
that a Republican is going to win. But (3) introduces no such as- 
sumption; the antecedent of (3) says only that Reagan will not win. 
So it is open to us, in the case of (3), to incorporate the assumption 
that Reagan will not win by dropping the belief that a Republican 
will win. In fact, since the likely reason that we believe that a Repub- 
lican is going to win (in the circumstances under consideration) is 
that we believe that Reagan is going to win, it is natural to think that 
the smaller revision in our set of beliefs would involve modifying the 
belief that a Republican will win, that is, (2), rather than modifying 
the belief that Anderson will be a distant third in the election, which 
is why we are apt to think that (3) is false. But then, so is (1). 

In summary, the assumption that the law of exportation is valid for 
the conditionals occurring in the inference from (1) and (2) to (3) 
may lead us to suppose that the first premise is true and that the in 
ference counts against modus ponens. The law of exportation, how- 
ever, is valid only for truth-functional conditionals. Taken as a 
truth-functional conditional, the first premise is true, but then so is 
the conclusion; taken as a nomtruth-functional conditional, the com 
clusion is false, but then so is the first premise. We see, accordingly, 
that the inference from (1) and (2) to (3) does not count against 
modus ponens on any consistent interpretation of the conditionals 
occurring in it. 

BERNARD D. KATZ 
University of Toronto 
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COMMENTS AND CRITICISM 
SEMANTICS SAN DIEGO STYLE* 


ver since David Rummelhart and James McClelland’ began 
Ess pioneering research into parallel distributed processing 

(PDP) over a decade ago, the University of California/San 
Diego has been a center for research into PDP models of cognition 
and the application of this research to traditional issues in such 
fields as anthropology, linguistics, philosophy, and psychology. As 
part of the interdisciplinary cognitive science program at UCSD, 
Paul M. Churchland? was quick to recognize and bruit the philo- 
sophic import of PDP, or connectionist, models of cognition, apply 
ing it initially to issues in the philosophy of mind and the philosophy 
of science. Most recently, Churchland has explored the possibility of 
using connectionist resources to defend a conceptual-role theory of 
semantics. In a recent series of exchanges, Churchland and the 
team of Jerry Fodor and Ernest Lepore’ have been engaged in a de- 
bate over whether statespace semantics (SSS) can fortify conceptualrole 
semantics by providing a robust account of similarity of meaning. I 
agree with Churchland that SSS constitutes a “serious alternative™ to 


the language of thought (LOT); however, by construing SSS as a ver- 


* I am grateful to Paul Churchland, Mark Collier, Adrian Cussins, Aarre Laakso, 
and Eric Thomson for helpful discussions on these issues and for comments on 
earlier drafts of this paper. I am also thankful for helpful comments from aud} 
ences at the University of California/San Diego and Rutgers University, where ear- 
Her drafts of this paper were read. 

! Paralled tm the Microstructure of Cognition (Cam- 
ate MIT, 1986), which is considered the locus dasncus of connectionism. 
Perspectios (Cambridge: MIT, 1989). 

r Fodor and re, “Paul Churchland: State Space Semantics,” in their Hotsw- 
A Shopper's Guide (Cambridge: Blackwell, 1992), pp. 187-207; Churchland, “Fodor 
and Lepore: State-space Semantics and Meaning Holism,” Philosophy and Phenome- 

Research, Lo (September 1993): 667-72; Fodor and Lepore, “Reply to 
Churchland,” Philosophy and Phenomenological Research, Lin (September 1998): 679- 
82; Churchland, “Second Reply to Fodor and Lepore,” in R. McCauley, ed., The 
Churchlands and Their Cniscs (Cambridge: Blackwell, 1996), pp. 278-838; Church- 
land, “Conceptual Similan Ta Sensory and Neural Diversity: The Fodor/Lep- 
ore Challenge Answered,” this JOURNAL, Ke, 1 (January 1998). 5-82. 

t After showing that language of thought commits one to the view that we have 
not learned every language that we know, Fodor writes: “I admit that these conclu- 
sions really may seem scandalous. I should be inclined to new them asa reducto 
ad absurdum of the theory that learning a language us learning the semantic prop- 
erties of its predicates, except that no serious alternative to that theory has ever 


been proposed”— The Language of Thought (Cambridge: Harvard, 1975), p. 82 
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sion of conceptualrole semantics, I think that Churchland has actu- 
ally done a disservice to the theory. 

I make a distinction here between what I call the vehicle question 
and the content question. One of the central claims of this discussion 
is that SSS, properly construed, is a thesis about the vehicle of com 
tent, and thus is neutral with respect to any particular answer to the 
content question (for example, conceptual-role, causal, teleologi- 
cal).* Even if this is right, the question as to whether SSS can make 
conceptual-role semantics any more plausible is interesting and 
worth pursuing. The primary criticism that Fodor and Lepore levy 
against SSS as a version of conceptual-role semantics is that, like all 
other conceptualrole theories, it cannot provide a robust account of 
similarity of meaning. In his most recent paper, Churchland’ argues 
that there exists a definitive response to this criticism. I argue here 
that the results to which Churchland appeals do not, in fact, provide 
an answer to Fodor and Lepore’s challenge. I make a distinction be- 
tween three different “stages” of SSS, and argue that Churchland 
takes a step in the right direction by making it clear that he is not de- 
fending the naive stage-one SSS attributed to him by Fodor and Le- 
pore. Even the more sophisticated stage-two SSS which Churchland 
defends, however, does not come to grips with the full force of the 
similarity burden posed by Fodor and Lepore. I finally argue that a 
kind of conceptualrole SSS can meet the similarity burden if one 
moves to stage-three SSS, though in this version the extent to which 
conceptual role determines content is somewhat mitigated from that 
of a full-fiedged conceptual-role theory of semantics. 

I A THESIS ABOUT THE VEHICLE OF CONTENT 
SSS is a thesis about the vehicle of semantic content. What I mean 
by a ‘vehicle’ of content is simply what carries the content. Another 
way to think of the vehicle of content for a system is that its proper- 
ties are those properties which affect the computational functioning 
of that system.’ I shall further explicate this conception of SSS here 
by contrasting it with two competing answers to the vehicle question: 
the linguistic tradition and the mentakstic tradition. While SSS is a com- 


* Because of this, it has been suggested to me that I should call the tion 
“state-space syntax.” I am sticking with the label ‘state-space semantics’ both for 
histoncal reasons and because, even though the particular view of reference one 
accepts in the end may differ from one state-space theory to the next, they are all 
semantic theones within a state-space framework. 

* “Conceptual Similarity.” 

” See Adrian Cuns, “The Connectionist Construction of Concepts,” in Mar- 
garet Boden, ed., The Philosophy of Artificial Inteligence (New York: Oxford, 1990), 
368-440, here p. 369 
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petitor of LOT, which belongs to the latter tradition, I shall attempt 
to show that the two theories are actually closer to one another than 
either Fodor and Lepore or Churchland seem willing to acknowl 
edge. 

The view that stands in sharpest contrast to SSS is what I am call- 
ing the linguistic tradition, a view that holds the following two 
claims: 


(1) Linguistic entities carry content independently of how they connect 
with any particular mental representation. 

(2) The computational properties of our “syntactic/semantic engine” 
are those exhibited by language (for example, the inferential rules 
of propositional logic). 


For those who hold (1), linguistic entities are the primary unit of 
meaning, and the task in learning a language is to get one’s mental 
representations to correspond to the right words. On this picture, 
words must hook onto mental representations which, in turn, must 
hook onto the world. I take it that it is because the primary refer- 
ence relation on this view is from words to mental representations 
(rather than to the world) that Hilary Pumam in his Dewey Lectures* 
argues that the problem of reference is simply a replay of the seven- 
teenth-century empiricist problem of perception. The second of 
these claims follows naturally from the first. If the mind has content 
in virtue of corresponding to certain linguistic entities, then the 
rules that govern the computational functioning of linguistic enti- 
ties, namely, propositional logic, must also be the rules that govern 
the computational functioning of the mind. 

An alternative tradition, which can be called the mentalistic tradi 
tion, turns the picture of claim (1) on its head. That is, rather than 
conceive of language learning as a matter of getting one’s mental 
representations to correspond to words, one could see language 
learning as a matter of getting words to correspond to one’s mental 
representations.” The most systematically developed contemporary 
account of this alternative is probably LOT, which holds that the ve- 
hicle of content is a mental representation, in particular symbols in a 
“language of thought.” These symbols are the semantic simples of 


* “Sense, Nonsense, and the Senses: An Inquiry into the Powers of the Human 
Mind,” this JOURNAL, XCI, 9 (September 1994): 445-517. 

* On this view, the primary reference relation is from mental representations to 
the world. Since the primary reference relation already involves the world, this 
may be a way around the problem that Putnam presents in his Dewey Lectures 
(Had). 

” See Fodor, The Language of Thought. 
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the language of thought, like propositions are the semantic simples 
of propositional logic. While LOT is a member of the mentalistic 
tradition, in holding (2) I take its defenders still to have a foot in the 
linguistic tradition, for they still view thought as proceeding in the 
same way language does; that is, they take the computational proper- 
ties of mental representations to be the same as the computational 
properties of propositions. 

SSS follows LOT in rejecting the first claim of the linguistic tradi- 
tion. In this respect, SSS and LOT can both be seen as members of 
the same tradition in the philosophy of language. Both SSS and LOT 
start with a representational theory of mind, that there must be “men- 
tal particulars possessed of semantic and syntadic properties” ` and that 
“mental processes are transformations of mental representations.”™ 
In addition to a commitment to realism about mental representation, 
SSS shares with LOT the conviction that learning a natural language 
requires having a system of mental representations in place. That is, 
SSS agrees that “certain kinds of very central patterns of psychological 
explanation presuppose the availability, to the behaving organism, of 
some sort of representational system,” and especially that “to extend 
our psychology to infrahuman species is thereby to commit ourselves 
to cognitive processes mediated by representational systems other 
than natural languages." Because LOT assumes that mental repre- 
sentations are symbols in a language of thought, it insists that the com- 
putational properties of mental representations follow that of natural 
language. This is where SSS parts company with LOT. Because SSS 
has a different account of mental representation, it has a different ac- 
count of the computational properties of those representations. 

SSS is based on a connectionist theory of mind. The primary unit 
of representation in connectionist models of cognition is a pattern of 


" Fodor, “Fodor's Guide to Mental Representation,” in his A Theory of Content 
and Other Exsays (Cambridge: MIT, 1990), pp 3-27, here p. 23. 

* In his “Guide to Mental Representation,” Fodor denies that Churchland holds 
a representational theory of mind because Churchland 1s an eliminatvist about the 
mental states postulated by folk psychology. It would only follow from Church- 
land's eliminativism that he rejects the representational theory of mind if the men 
tal states of folk psychology were the only kind of mental representations poenble. 
But this is precisely what any connectionist theory of mind denies. Connectionism 
maintains that mental representatons are best thought of as multidimensional 
state-spaces. (This does not mean that connectionism is iucompatible with folk 
psychology, only that one can eliminate the entues of folk psychology and still be 
left with mental representatons.) 

= The of Thought, p 31 

" Churchland takes a similar line for the case of children when discussing what 
he labels “the problem of early stages”"-—Scenafic Reahsm and the Plastictty of Mind 
(New York: Cambndge, 1979), pp 127-37. 
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actvation across a population of neurons, such as V1 of the primary 
visual cortex, or a certain layer in an artificial network. This pattern 
of activation can be thought of as an mdimensional activation vector, 
where n is the number of neurons in the population, as it can be 
uniquely characterized by an ntuple of numbers with each element 
corresponding to the activation level of a given neuron. The pattern 
of activation can also be represented as a point in an n-dimensional 
state-space, with each axis of the space corresponding to each neu- 
ron in the population. One can also combine the last two and repre- 
sent the pattern as a vector in state-space. All of these are merely 
different ways of representing the same thing. According to SSS, the 
vehicle of content = a pattern of activation = an activation vector = 
a point in state-space = a vector in state-space.” Vectors in state- 
spaces have computational properties different from those of sym- 
bols in a language of thought. In fact, the kinematics and dynamics 
of thought processing on the state-space model looks nothing like 
that of language. According to SSS, thought proceeds by vector- 
to-vector transformations rather than by logical inferences between 
sets of sentences. 

I have claimed that SSS properly construed is a thesis about the ve- 
hicle of content. In claiming that the vehicle of content is the state- 
space, I take SSS to be rejecting both claims of the linguistic 
tradition and instead holding the following two claims: 


(1) Mental representations (in particular state-spaces) are the vehicles 
of content, and linguistic entities have content only insofar as they 
“hook onto” these mental representations. 

(2’) The computational properties of our “syntactic/semantic engine” 
are those of connectionist models of cognition (for example, 
vector-to-vector transformation and prototype activation). 


In claiming that SSS is a thesis about the vehicle of content, I am 
also claiming that SSS is neutral with respect to the “content ques- 
tion,” the question as to how a particular vehicle obtains its content. 
One could simply apply her favorite semantic theory to SSS. For ex- 
ample, a causal role SSS would state that a given activation vector v 
in S means (refers to) Xif and only if: (a) ‘X causes v’ is a law, (b) at 
least some Xs cause v (in the actual world), and (c) for all Y#xX 
which cause v, Y would not have caused v were either (a) or (b) 
false; a functional/teleological SSS would state that a given activation 
vector vin S means (has the content of) X if and only if it is the 


» While in his “Conceptual Similarity” Churchland still sees SSS as a version of 
holism, he does emphasis what the vehicle of content is for SSS, and provides even 
more detail, including helpful diagrams, than that given here. 
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function of v to indicate Xs to the system in which Sis instantiated; a 
conceptualrole SSS would state that the content of a given activation 
vector, v, in a state-space, S, is determined by the role it plays in the 
larger conceptual economy of the agent in which S is instantiated; 
and so on for any semantic theory. In making this last move, I have 
conceded to Fodor and Lepore that in order to ascribe content to a 
particular vector, one must already presuppose some semant theory. 
But this is not an argument against SSS as a thesis about the vehicle 
of content. In fairness to Fodor and Lepore, they were not arguing 
against SSS so construed and thus did not intend the objection to be 
directed at this conception of SSS. It may be the case that I have said 
nothing with which a defender of Fodorian style semantics would dis- 
agree (see footnote 5). In claiming that one can apply any semantic 
theory to SSS, I have implicitly asserted that Churchland’s 
conceptual-role version of SSS can work; however, that is precisely 
what the primary debate has been between Churchland and Fodor 
and Lepore, and it is to this debate that I now turn in the remainder 


of this discussion. 
Il CONCEPTUAL-ROLE STATE-SPACE SEMANTICS 


The primary criticism that Fodor and Lepore levy against SSS as a 
version of conceptuakrole semantics is that it falls victim to the same 
criticism that afflicts all conceptualrole theories, namely, that it can- 
not provide a robust account of similarity of meaning. In his most 
recent paper, Churchland argues that recent results in cognitive sci- 
ence “constitute a decisive answer to Fodor and Lepore’s 
challenge.”"* I shall argue here that the results to which Churchland 
appeals do not, in fact, get conceptual+tole SSS off Fodor and Lep- 
ore’s hook. 

Before moving to Churchland’s defense, it will be instructive to re- 
visit the original challenge facing conceptuakrole SSS. The problem 
was that similarity of meaning between two state-spaces is understood 
as similarity of activation across dimensions. This requires a way of 
distinguishing when two spaces are two tokens of the same state- 
space type, which in turn requires that the content of each dimen- 
sion in one space be identical to the content of its corresponding 
dimension in the other. In order for this condition to be met, there 
must be a property expressed by each dimension in the respective 
spaces. But if one is to attribute a property to the dimensions, then 
one must already have a notion of meaning. Furthermore, in order 
to tell whether the dimensions of each space express the same prop- 
erty, one must already have a notion of identity of meaning. Not 


* “Conceptual Similanty,” p. 5. 
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only does SSS not solve the problem of similarity of meaning, it fails 
even to avoid the problem, and so cannot make conceptual-role se- 
mantics any more plausible than it was to begin with.” 

This criticism was originally aimed at what I shall call stage-one SSS, 
which individuates the content of activation vectors by absolute posi- 
tion in state-space (that is, position in relation to the axes). It thus 
attempts to provide a robust account of similarity of meaning by tak- 
ing similarity of meaning as equivalent to similarity of location in 
state-space. There are passages from Churchland” which suggest 
that this is the conception of SSS which he holds: 


..a visual sensation of any specific color is literally identical with a spe- 
cific triplet of spiking frequencies in some triune brain system...then 
the similarity of two color sensations emerges as just the proximity of 
their respective state-space postions (ibid., p. 104; my italics). 


As the above passage shows, this account was originally intended as 
an account of experience, but, as Fodor and Lepore recognize, the 
state-space model can provide an account of semantic content as 
well as of the content of phenomenal experience. That is, not only 
can one consider the expenence of red to be a pattern of activation 
(that is, a point in state-space), but one may also consider the seman- 
fc content of the term ‘red’ to be a pattern of activation. 

The above criticism is sound against stage-one SSS. Neither I nor 
Churchland, however, wish to defend stage-one SSS. In his more re- 
cent papers on the topic, Churchland clarifies that he is not defending 
stage-one SSS, but rather (call it) stagetwo SSS. While stage-one SSS as- 
cribes content to vectors in virtue of the content of the dimensions 
and defines similarity across spaces in terms of similar absolute posi 
tion in state-space, stage-two SSS holds neither of these views. In the 
previous section, I showed how conceptual-role SSS (as well as any 
other version of SSS) can ascribe content to activation vectors without 
ascribing content to the individual dimensions of the space. With re- 
gard to similarity, stage-two SSS defines similarity across spaces in 
terms of relative position, that is, the position of a vector in relation to 
the other vectors. Stage-two SSS then individuates the content of acti 
vation vectors by similar relative positions, that is, by similarity of 
clustering. The basic idea behind the similarity of clustering condi- 
tion is that a given activation vector vin space $ is similar in content 
to an activation vector win space S, just in case v stands in the same set 


" Fodor and Lepore, “State-Space Semantics,” pp. 197-99. 
w “Some Reductive Strategies in Cognitive Neuroblology,” in his A Neurocomputa- 
tional Perspectrve, pp. 77-110. 
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of relations to the other vectors in S, as does w to the other vectors in 
S,. For stage-two SSS, vectors of different spaces can be representa- 
tionally similar to one another regardless both of the content of the 
dimensions of each space and of the number of dimensions of each 
space.” 

With stage-two SSS formulated, we can now address the question 
of whether Fodor and Lepore’s criticism against conceptualrole SSS 
still has force. The burden for the proponent of conceptualrole SSS 
is to provide an account of how to determine whether the activation 
vectors of one state-space stand in the same set of relations to one 
another as do the activation vectors of another space. If the propo- 
nent of conceptualrole SSS can meet this burden, then indeed SSS 
has made the world safe for conceptual-role semantics by providing a 
robust account of similarity of meaning. 

Churchland argues that the experimental results of the work by 
Aarre Laakso and Garrison Cottrell” “constitute a decisive answer to 
Fodor and Lepore’s challenge.” Laakso and Cottrell have developed 
a measure similar to Guttman’s point alienation, which purports to 
measure the degree of similarity between two representational struc- 
tures (for example, two state-spaces). I shall not here go into detail 
about the experiment, as a fairly detailed account was presented by 
Churchland. The basic idea behind the measure is that one can 
look at three points at a time, 7, p, and q, hold one of the points, say, 
r, as the anchor, and compute whether p or qis closer to r. One then 
does the same thing for the corresponding points r^, ph, q° of another 
space and compares the ranking. The procedure is then repeated 
for all possible triplets. If the two spaces have exactly the same set of 
rankings, the measure will have a value of 1, and if they have exactly 
the opposite set of rankings, the measure will have a value of -1. 

Laakso and Cottrell used this measure to compare networks that 
were trained up to identify an input pattern as being one of five cok 
ors: red, yellow, green, blue, or purple. They trained up twenty dif 
ferent networks, five networks for each of four different ways of 
representing the input pattern.” They then used the measure to 


" In his “Conceptual Similarity,” pp 8-13, Churchland develops this idea in 
more detail and provides some very helpful diagrams, including one of a “family 
space” consisting of the Hatfelds, McCoys, and Andersons. 

* “Quala and Cluster Analyns. Assessing Representational Similarity between 
Neural Systems” (under review), presented in earlier draft to the 1996 meeting of 
the Society for Philosophy and Psychology, San Francisco. 

= The four ways of representing the inputs were: (1) an 8-bit binary number, 
(2) a decimal number between 0 and 1, (3) a gausman curve over five elements, 
and (4) sequentially as a 3-digit decimal. So each network had a different number 
of cells (that is, dimensions) and the cells of one network represent completely dif- 
ferent properties than those of another. 
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compute the rank-order correlation both between networks that 
were trained up on the same input-encoding and between networks 
that were trained up on different input-encodings. They found that 
the rank-order correlation between networks trained up on the same 
input-encoding was almost always greater than .95, meaning that the 
activation vectors of the different networks were very similarly clus- 
tered between states. They also found that the rank-order correla- 
tion between networks trained up on different input-encodings was 
also very high, usually greater than .9 (with a minimum value of .85). 
These results show that even when networks have completely differ- 
ent ways of representing an input pattern at the “sensory” level, they 
can still have highly similar ways of representing that input at a 
higher level (that is, of “conceptualizing” the input). For this experi- 
ment, the middle-layer activation vectors of the state-space of one 
network do stand in approximately the same set of relations to each 
other as do the activation vectors in the state-space of another net- 
work. 

It is easy to see why Churchland took these results to be a conclu- 
sive answer to Fodor and Lepore, as it seems clear that it shows that 
there exists an empirical method for determining whether the activa- 
tion vectors of two spaces are similarly clustered, and thus that the 
burden for conceptualrole SSS as laid out above is met. The ques- 
tion, however, is whether it does so in a way that does not already 
presuppose a theory of content, for the original charge against 
conceptuaLrole SSS was that it could only provide a robust account 
of similarity of meaning by presupposing a theory of meaning. 

The SSS skeptic will probably have already noticed that the Laakso- 
Cottrell measure involved comparing a triad of vectors in one space 
with a “corresponding” triad of vectors in a different space. That is, 
the measure presupposed that there was no problem with labeling 
one vector rin space $ and another “corresponding” vector r'in Sy. 
In the actual experiment, this in fact was not a problem, for which 
vectors corresponded to which was simply determined by the input 
pattern that gave rise to the vectors. If two networks were both shown 
input pattern red,, then the activation vector produced at the hidden 
layer in one network is labeled n, and the activation vector produced 
at the hidden layer of the other network is labeled 1,4, and similarly 
for, say, orange, and blue. The two triads can then be put through 
the measure to see if the two networks conceptualized red,, orange, 
and blue, similarly (and likewise for all such triads of inputs). 

This may be fine as an account of the similarity of qualia cluster- 
ing; however, if the issue is how one goes about assigning content to 
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vectors in the first place, then the Laakso-Cottrell measure does not 

seem to be of help to conceptuatrole SSS. Consider again the initial 

charge against SSS: 
We're claiming, in effect, that Churchland has confused himself by tak- 
ing the labels on the semantic dimensions for granted. The label on a 
dimension says how positions along the dimension ars to be mntenpreted, for ex- 
ample, it says that they're to be interpreted as expressing degrees of F- 
ness. To label a dimension as the F-ness dimension is thus to invite the 
question “in virtue of what do the values of this dimension express de- 
grees of F-ness rather than, say, degrees of G-ness?” | 


If the proposal is to give an account of content similarity by appeal- 
ing to position in state-space, then the proponent of SSS bas given a 
circular account by helping himself to semantic labels for the dimen- 
sions, which means he has already presupposed a theory of meaning. 

The initial response from Churchland was to deny that it is neces- 
sary to label the dimensions with semantc content in order to as- 
cribe content to a vector: 


What makes each vector, Patricia’s and mine, a “kitten” vector is not the 
identity of our respective patterns of neuronal activity across the hidden 
layer (these are likely quite different), nor the semantic identity of the 
constituting dimensions in her hidden-unit population and in mine 
(their diffuse “contents” may well be quite idiosyncratic). What gives 
this vector the content “kitten” is the overall role that this vector plays 
in the larger cognitive and motor economy of which it is an interiock- 
ing part” 
This, though, only partially answers Fodor’s attack. It does show that 
it is not necessary to ascribe content to the individual dimensions of 
a space in order to ascribe content to a particular vector in that 
space (this is the move from stage-one to stage-two SSS). But it does 
help itself to a pre-existing theory of meaning, namely, conceptual 
role semantics. I have already noted that this is not an argument 
against SSS conceived as a thesis about the vehicle of content; how 
ever, the issue here is whether SSS makes conceptual-role semantics 
any more plausible (than, say, a sententially based conceptual-role 
approach), and at stage-two SSS, the issue of content similarity sim- 
ply arises anew. The question is how one can determine whether 
Paul’s ‘kitten’ vector plays a similar role “in the larger cognitive and 
motor economy of which it is an interlocking part” as does Patricia’s 
‘kitten’ vector. 


™ Fodor and Le pore, “State-Space Semantics,” p. 199. 
™ Churchland, “State-Space Semantcs and Meaning Holism,” p. 670 
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I argued above that under stage-two SSS this challenge can be met 
if one can provide a criterion for determining whether the vectors in 
one space stand in the same set of relations to one another as do 
those of another vector. The next response by Churchland was that 
the Laakso-Cottrell results meet this challenge. If this only works by 
first labeling the vectors of each space, however, then Churchland 
has cheated even worse than was initially charged; for he has helped 
himself not simply to labels for the dimensions, but to labels for the 
actual vectors themselves. If there was no problem about when one 
could say that a vector of one space corresponds to a vector of an- 
other space, then why is the Laakso-Cottrell measure needed in the 
first place? To solve the problem in the way that was done for the ac- 
tual experiment and say that the corresponding vectors were identi- 
fied by corresponding input is just to help oneself to an externalist 
theory of content. But to appeal to some externalist theory to fix the 
content of vectors would obviate the conceptuatrole part of the con- 
tent story. 

This does not mean that conceptualrole SSS is a doomed theory. 
It just shows that the Laakso-Cottrell results does not get conceptual- 
role SSS out of the similarity of meaning problem. There may be 
as-of-yet-untapped resources that meet the similarity of clustering 
burden without presupposing some externalist theory of content. 
This also does not show that conceptualrole semantics is bankrupt. 
All this means is that SSS has not yet made conceptualrole semantics 
any more plausible than it was to begin with. This will be cause for lit- 
tle concern for those who think that conceptual-+ole semantics is al- 
ready the most plausible semantic theory in the offing. I also do not 
mean to suggest that the Laakso-Cottrell results are of no use to SSS. 
In the next section, I shall develop stage-three SSS and show the im- 


portance of these results for that conception. 
NI. STAGE-THREE STATE-SPACE SEMANTICS 


While the foes of conceptual-role semantics are now smiling, the 
friends of conceptualrole semantics may be skeptical of my abrupt 
dismissal of the Laakso-Cottrell results. After all, I suggested in the 
last section that, if one already had labels for the vectors of each 
space, then the results of applying the Laakso-Cottrell measure were, 
at best, redundant and, at worst, completely irrelevant. But, one may 
respond on behalf of conceptual-role SSS, the results could have 
been negative! That is, once the LaaksoCottrell measure was ap- 
plied to the conceptual structure at the middle layer of the differing 
networks, a low or negative score could have been the result. In fact, 
when the measure was applied to the conceptual structure at the in 
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put layer, the results were negative—the networks did not conceptu- 
alize the colors similarly at all at the input level. So the positive re- 
sults at the middle layer must mean something. I shall propose here 
that the way to resolve the tension between the conclusion of the last 
section and the intuition that the positive results of the Laakso-Cottrell 
experiment mean something is to draw a distinction between two 
types of representational content, which I shall call correlational and 
embedded content. 

The correlational content of a representation is roughly that with 
which it is correlated. If a given representation is correlated with 
cows, then the correlational content of that representation is ‘cow’. 
This notion is very close to what the Churchlands called “calibra- 
tional content.** What I mean by correlational content is also very 
close to the ordinary notion of reference. The main difference is 
that (depending on one’s theory of reference) the two notions may 
not be co-extensional. It is probably the case that the correlational 
content of a given representation is always that to which it refers, 
though there could be cases in which a representation is nonrefer- 
ring but still has correlational content. One such example might be 
fictional characters. The name ‘Pegasus’ may be nonreferring, but it 
could still have correlational content, namely, a certain creature 
from Greek mythology. Divergence between reference and correla 
tional content may also arise when one’s ontology does not corre- 
spond to the world; the term ‘demonic possession’, for example, 
may be nonreferring and yet have correlational content, perhaps a 
particular type of neurosis. 

The content that I want to designate by the label ‘embedded con- 
tent’ is that which a representation has in virtue of being embedded 
in a certain cognitive structure of a system (or psychological reality 
of an agent).” That is, the embedded content of a given representa- 
tion is the significance that its correlational content has for the cog- 
nitive economy of the system of which it is a part, given the cognitive 
structure of that system. This will include both the relations that the 
representation bears to other representations in the system and to its 
downstream causal connections such as its effect on thought (mental 
processing) and behavior (motor control). My notion of embedded 
content is very similar to the Churchlands’ notion of translational 


~ Paul Churchland and Patriaa Churchland, “Functionalism, Intentionality, 
and Qualia,” in Paul Churchland, A Meurocomputational Perspective, p. 

” I specifically want to avoid calling embedded content ed Se because I 
want to allow for embedded content to be nonconceptual, in the sense specified by 
Gareth Evans, The Vaneties of Reference (New York: Oxford, 1982); or Cussins. 
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content, which “is a matter of the inferential/computational rela- 
tions it bears to all the rest of [the system’s] representations.”™ One 
main difference between my embedded and their “translational” 
content is that they do not seem to include the behavioral effects of 
a representation as part of the translational content (though per- 
haps they would not have a problem with doing so). Even if there is 
no conceptual difference between the two notions, one problem I 
have with the label ‘translational’ is that it sounds too conceptual 
and suggests a close connection to natural language, whereas, the 
content that [ want to designate with the label ‘embedded’ need not 
be conceptual, nor need it only apply to the content of terms in a 
natural language. 

Given this distinction, there is a straightforward way of resolving 
the tension outlined above. The correlational content of a given ac- 
tivation vector is best specified on some externalist approach. This is 
how one can label the vectors of a given space. I have tried in speci- 
fying what I mean by correlational content not to prejudice the issue 
in favor of externalism, but instead to leave room for an internalist 
(for example, conceptualrole) approach to correlational content. 
Such an approach, however, will have to deal with the similarity ar- 
gument with which we began, and one will not be able to appeal to 
the Laakso-Cottrell results to meet this challenge with respect to cor- 
relational content. An internalist approach to correlational content 
will also have to deal with the many arguments against internalist ap- 
proaches to reference. Embedded content, on the other hand, does 
fit most naturally with some internalist, conceptual-role approach. 
The LaaksoCottrell measure then provides a way of determining 
whether (at least one aspect of) the embedded content of a vector 
with a particular correlational content in one space is similar to that 
of a vector with the same correlational content in another space. 
The positive results of the Laakso-Cottrell experiment show that the 
embedded content of the activation vectors whose correlational con- 
tent is a certain color was indeed similar across the state-spaces at the 
middle ayer level of differing networks. In solving the similarity of 
content problem for embedded content, one can make use of the 
Laakso-Cottrell measure without circularity.” 


= “Functionalism, Intentionality, and Qualia,” p. 43. 

7 The distinction between correlational and embedded content may also pro- 
vide a framework for addressing the problems surrounding the connection be- 
tween certain kinds of beliefs (for example, indexical or moral) and action. For 
example, while it is true that the correlational content of indexical representations 
is necessarily specified token reflexively, the substantive question is whether the 
embedded content of indexical representations necessarily contains a motivational 
force (see my “What Is Essential about Indexicals*” forthcoming in Philasophical 
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IV CONCLUSIONS 

All versions of SSS have in common the claim that the occurrent ve- 
hicle of content is an activation vector across a population of neu- 
rons. Stage-two SSS differs from stage-one SSS in how it individuates 
the content of those vectors, but both versions assume a unitary no- 
tion of content. That is, both versions implicitly assume that activa 
tion vectors are the vehicles for a single type of content Stage-thres 
SSS differs from the first two versions in holding that the content of 
vectors must be individuated along (at least) two dimensions. 
Churchland correctly identifies the vehicle of content for SSS, and as 
such is an improvement over LOT. He does a disservice to the the- 
ory, however, by presenting it specifically as a version of conceptual 
role semantics. In meeting Fodor and Lepore’s similarity of 
meaning challenge for conceptual-role SSS, Churchland again 
makes a move in the right direction by distinguishing stage-two from 
stage-one SSS; however, the experimental results to which Church- 
land appeals do not, in fact, get conceptuakrole SSS off Fodor and 
Lepore’s hook. By moving to stage-three SSS, one can allow concep- 
tual-role to determine one type of content and use the Laakso- 
Cottrell results to provide a robust account of the similarity of that 
type of content. In the end, though, regardless of the extent to 
which conceptual-role provides an answer to the content question, 
SSS, properly construed as a thesis about the vehicle of content, is 
plausible and Churchland has done a service by bringing its exis- 
tence to light. 

EVAN TIFFANY 
University of California/San Diego 


Studwes). Likewise, the debate about moral motvation can be construed as whether 
the embedded content of moral representatons necessanly contains a motiva 
tional force. It may be the case that much of the dispute between internalsts and 
externalists boils down to a conflation of the two types of content. That is, perhaps 
internalists of an expressivist or prescriptivist ilk may have been trying to provide 
an account of the embedded content of moral terms, while externalist naturalists 
may have been trying to provide an account of correlational content. The former, 
ın asserting that the embedded content contains a motvational force, and the lat- 
ter, ın denying that the correlational content does, are not in fact disagreeing 
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BOOK REVIEWS 


Having Thought: Essays in the Metaphysics of Mind. JOHN HAUGELAND. 
Cambridge: Harvard University Press, 1998. 390 p. Cloth $29.95. 


On the back of the dust jacket of this fine book, one can barely make 
out two representations of a customized penny for our thoughts, 
drawn by John Haugeland. Accompanying Honest Abe on the heads 
side appear the words ‘Existential Commitment’, ‘Thought’, and 
‘Self’; while tails shows the Lincoln Memorial and E pluribus unum, 
surrounded by two unlikely additions: ‘Constituted Domain’ and 
‘Objects’. Haugeland explains: “The basic Kantian/Heideggerian 
conclusion can be summed up this way: the constituted objective 
world and the free constituting subject are intelligible only as two 
sides of one coin.” Not everything with eyes and ears and a brain is a 
“free constituting subject”; apes and dogs and dolphins do not have 
ontologies because they do not have thoughts; they do not have 
thoughts because they do not have the “censoriousness” in their cul 
tures or social structures which provides the leverage for understand- 
ing, distinguishing a true thought from a false thought, and without 
that, thoughts cannot really have content. There are, to be sure, im- 
portant differences in the Umwelt or manifest image of different 
species, but cats have no more metaphysics than clams or chrysan- 
themums do. Objects are constituted by people only, and our ulti 
mately moral sense of norms, of “existential” commitment, far from 
being a sort of ethical add-on to the factual world of objects and 
properties, is the very ground on which our capacity to know, and re- 
flect upon, objects depends. 

This is a collection of essays, all but two previously published, so 
the question must be posed: Are the benefits of the synergy worth 
the cost? Here is one point in favor: I had read almost all these es- 
says several times, and discussed some of them with Haugeland for 
dozens of hours, and even published commentaries on several of 
them, but I did not get the big picture until I read the book. That is, 
I had seen what he was driving at, but was unpersuaded until I saw 
the pieces put together anew. I now think he is largely right. Even 
better: the points on which I harbor residual doubts are, in general, 
more empirical than philosophical, and will mostly sort themselves 
out as we learn more about how minds are actually constructed. 
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Haugeland claims that the difference in kind, not degree, between 
the “cognitive” capacities of people and animals is too obvious to re- 
quire argument. This might be true in the philosophical commu- 
nity, but it is at any rate no longer true in the world of animal 
cognition research, where the backlash against the behaviorism of 
yore makes fighting words of any such pronouncement. Still, 
Haugeland is right that the gulf between animal mind and human 
mind is huge. It is not that surprising to see a philosopher claim 
that even the brightest of beasts do not have a manifest image much 
like ours, but the paths he takes from this starting point are strik- 
ingly novel. In particular, he firmly bucks the fashion and declares 
that language is not the key to this huge chasm. 


I certainly do not deny that language is characteristic of people, and 
centrally important to what we are. But I want to resist what strikes me 
as an all too prevalent tendency to make of language a magic fulcrum 
on which every human quality turns—as if language were the whole 
essence of what distinguishes us from machines on the one hand, and 
beasts on the other (256). 


A strange centerpiece of Haugeland’s case against the role of 
language in making—not just marking—the chasm is an extended 
thought experiment about imaginary “supermonkeys” who have no 
language but nevertheless play chess! They play real chess, embed- 
ded in the matrix of norms that constitute proper play, not the 
pseudo-chess of Deep Blue, whose Umwelt is as bereft of the appre- 
ciations of norms as a spreadsheet’s Umwelt is of an appreciation of 
the human value of a dollar. There is nothing constitutively lin- 
guistic about chess, certainly, but the contortions of Haugeland’s 
tale of how the supermonkeys come to their norm-guided involve- 
ment in the game are unpersuasive to me. Let us compare these 
curious beings to others we might imagine: superorangutans, also 
languageless, who somehow manage to build and navigate aircraft 
carriers. An aircraft carrier with no written signs, no instruction 
manuals, no gossiping crew or order-issuing captain may be logically 
possible in some philosophical sense, but so what? Haugeland is 
right to resist the popular, lazy idea of language as a magic ful- 
crum; but what about language as a garden-variety fulcrum, causally 
required in this world for chess, for building aircraft carriers—and 
for “normativity” itself? If human language and human norms 
emerged in a co-evolving package, with innovations in each laying 
the ground for elaborations in the other, back and forth—and that 
is how evolution typically works—then both ‘Language made norms 
possible’ and ‘Norms made language possible’ are misleading over- 
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simplifications. Haugeland’s resistance to the former is, however, 
salutary. 

Haugeland’s work on the conceptual foundations of cognitive sci- 
ence has been deservedly influential, and several of the classics are 
reprinted here: his auspicious debut with a target article in Behavioral 
and Brain Sciences, “The Nature and Plausibility of Cognitivism” 
(1978), and “The Intentionality All-Stars” (1990). Put these to- 
gether with “Analog and Analog” (1981) and “Mind Embodied and 
Embedded” (1995), and you have a lucid and balanced setting for 
what I consider the most valuable essay in the collection, “Represen- 
tational Genera” (1989). He describes this as “perhaps the most di- 
sheveled piece in the volume” (5), but while one can see why he was 
tempted to issue this self-disparagement, the fact is that this is a stun- 
ning piece, hugely ambitious, and surprisingly successful in meeting 
its ambitions. It purports to be a revisionist taxonomy of representa- 
tions, and that it surely is, but its new taxonomy amounts to a strik- 
ingly new way of thinking about representations. Few topics have 
been mulled over more than representations in the last half century, 
and one would think that by now all the major moves had been 
made, and that it was time for fine tuning. Indeed, as Haugeland 
says, all this mulling has congealed into canonical accounts—which he 
nicely defines as “what almost everybody expects almost everybody 
else to believe.” 

According to the canon, the two main genera are logical/linguis- 
tic and iconic/pictorial. Today, we are faced with the interesting 
question of whether the much-touted distributed representations of con- 
nectionism (or holograms, for instance) deserve to be recognized as 
a distinct genus, as different from pictures and sentences as pictures 
and sentences are from each other. Although the quirky problems 
in characterizing the “essences” of these primary genera have long 
been acknowledged, the canonical wisdom has been that this is not 
to be done in terms of the content of what they represent, but some- 
how in their manner or format or system of representing. Hauge- 
land sets out to show that the received wisdom is not just bedeviled 
by unsolved problems (everybody knows that), but that these prob- 
lems are unlikely to be solved since this underlying assumption is a 
huge mistake. This certainly must strike the mavens of the field, as it 
first did me, as wildly implausible. Ah, all the more reason to think 
that maybe Haugeland has caught a big fish. 

He develops his case with a very well-considered game plan. Im 
deed, this essay is a bravura display of philosophical move making of 
the highest order. If I were to put together a “how to” book on 
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“how to do philosophy,” this essay would be one I would dissect at 
length, revealing its virtues. Most philosophical books and articles 
have a rather plodding plan of attack, along the lines of: many be- 
lieve that p; there are n familiar arguments against p, and there are n 
+8 familiar rebuttals to those arguments; here is a new argument 
against p. Haugeland sees his task more subtly in a three-stage devel- 
opment. It begins with some loosening-up counterexamples which 
he does not bother refining in the standard way, since they are just 
attention directors; this he follows by some “outlandish counter the- 
ses,” which are themselves not to be taken seriously but which hint at 
the opening to come in the third movement: a way of seeing how 
our canonical assumption might be undercut. 


I am arguing instead that all three canonical accounts misfire for a sin- 
gle underlying reason—namely, they all mislocated the generic essences 
in the representing relation—and hence, the appropriate response is 
neither repair nor despair, but fundamental reconception (177). 


The fundamental reconception involves taking on a hitherto unre- 
marked distinction between recording, which can be a “witless” 
process, and representing, which, in general, cannot. This is promis- 
ing, somewhat to our surprise, precisely because Haugeland is not 
planning to use it to make some ‘No machine can ever do x claim—a 
good example of the tactical discretion which abounds in this easay. 
His distinction between “witless” and “knowing” processes enables 
him to describe a new type of theoretical entity: “‘virtual logical rep- 
resentations’ (the ‘facts’)” (183). We all know, do we not, that any 
doctrine of atomic and molecular facts is doomed if it treats such 
facts as somehow independent of language expressing them? But 
once again, he is not planning to push his doctrine of facts into that 
hopeless cause; he has imagined another, more defensible use for 
them. 

The task facing any “Heideggerian/Kantian” theorist is to do jus- 
tice to the role of us in constituting the denizens of “our” world with- 
out lapsing into awful relativism/subjectivism on one side or caving 
in to noumena or a “God’s eye view,” on the other. Haugeland’s so- 
lution, which grows on me, is to show how and why it is Aard to “con- 
stitute” a world (that takes care of anything-goes relativism) but not 
because there is a privileged way that the world—the real world—has 
always been constituted. His view is a close kin, I think, of my view of 
the evolution of colors: before color vision evolved on this planet, 
sunsets and cliffs and volcanic eruptions had the reflective proper- 
ties they did, but i makes no sense to ask if those sunsets were, say, 
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red-since that question has no meaning independently of a refer- 
ence class of normal observers. We can, of course, extrapolate back 
from our current vantage point and fix and answer such questions, 
using ourselves as the touchstone for colors, but we must recognize 
that we are doing that. Were there dinosaurs before Homo sapiens 
came along and invented censoriousness and then ontology so that 
dinosaurs could be constituted? Of course there were, but do not 
make the mistake of thinking that this acknowledges a fact that is in- 
dependent of Homo sajnens. 

Haugeland is not a neurophilosopher or psychophilosopher or 
cognitive scientist; he is a pure (or hard core—chacun à son goût) 
philosopher. Yet he brings to his task—the unflinching considera- 
tion of grand metaphysical themes from Immanuel Kant to Richard 
Rorty—an engineer’s deep commitment to clarity and making sure 
things work. He once turned an old IBM selectric typewriter into a 
computer printer (think about that interface problem!) and is the 
author (in assembly language) of some impressive software for doing 
automatic file back-ups, so he has plenty of hands-on acquaintance 
with the relations between mechanisms and codes, “symbols” and re- 
alities, hardware and software. Like the computer scientist/meta- 
physician Brian Cantwell Smith, whose remarkable book On the 
Origin of Objects' would make a superb seminar mate with this vol- 
ume, he has seen that philosophers’ fantasies about computation 
and representation have not been innocent oversimplifications but 
major misleaders that must be replaced. 

The corrections of outlook Haugeland recommends are always 
presented in a constructive spirit. “The Intentionality All-Stars” is a 
frequently hilarious view of all the occupied positions on the topic of 
intentionality, laid out on a baseball diamond, and while he makes it 
very clear where his own sympathies lie, there are no idiots on the 
field of play. Haugeland’s pedagogical caricatures, unlike many in 
philosophy, bring out the best in each position. It turns out, for in- 
stance, that the main differences along the spectrum from first base 
(neo-Cartesian theory—for example, Jerry Fodor) and second base 
(neo-behaviorist—for example, Donald Davidson and me), to third 
base (neo-pragmatic theory—for example, John Dewey, Martin Hei- 
degger, Wilfrid Sellars, and Haugeland himself) can be seen to be 
due mainly to different empirical hunches about the relative impor- 
tance of factors admitted by all. For all its brilliance, this essay will 
not be readily understood by those without a substantial knowledge 


! Cambridge: MIT, 1996. 
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of baseball; the many allusions to the lore of the game really do a lot 
of work, and I know from experience that British philosophers of 
mind miss a lot of the wisdom. My prescription: philosophers every 
where now have a very good reason to add baseball to their stock of 
common knowledge. I also know from experience that my Ameri 
can undergraduates, who know the baseball, need help appreciating 
the essay’s glories because they do not have the requisite familiarity 
with the philosophical literature it positions. Same prescription: do 
your homework; it is worth it. 

Finally, I want to end this review with a specific compliment and a 
general complaint and call to arms. Some of the footnotes in this 
book are gems, deeply important, and not to be missed: for example, 
footnote 7 on page 44; footnote 10 on page 61; footnote 9 on pages 
164-65. Down with “endnotes”! There is no excuse, in this age of 
computer typesetting, for the “economy” of endnotes, either at the 
end of each chapter or at the end of the book. Few if any philoso- 
phy books need be read with only one hand, but why should we 
need three? I find that placing a Post-it on the edge of the current 
notes page provides a useful fingertab, and can be moved along as 
one goes, but this is still an annoying interruption. Authors, join the 
cause: ask publishers to put substantive footnotes at the bottom of 
each page, and all bibliographical material in some version of scien- 
tific format (Jones, 1994, p. 17)! Then philosophy books can aspire 
to be page turners in the good sense. 

DANIEL C. DENNETT 
Tufts University 
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MEASURING CONFIRMATION * 


hen belief comes to be seen as involving degrees of confi- 
W dence rather than simple yes-or-no judgments, the theory 

of rational belief must be adjusted accordingly. Proba- 
bilistic coherence stands out as an attractive analogue to the tradi 
tional desideratum of deductive consistency. And probability theory, 
as developed in Bayesian theories of rational belief, seems to go be- 
yond this. It provides a basis for conditionalization principles regu- 
lating change of belief—a topic about which traditional logic had 
little to say; and it offers a quantitative analysis of our notion of con- 
firmation, or evidential support. Confirmation, like belief itself, is 
something that obviously comes in degrees, but formal confirmation 
models based on deductive logic are merely qualitative. It seems 
that the relation these accounts seek to model must derive from a 
deeper, quantitative one; and Bayesians seem to be in a good posi- 
tion to provide one. According to the standard “positive relevance” 
account offered by Bayesians, a bit of evidence E confirms hypothesis 
H to the extent that it makes H more probable, in the sense that 
pr(H/E) > pr(f7). 

Now, this analysis cannot be thought to match our commonsense 
concept of evidence exactly. For one thing, the probabilistic rela- 
tion of positive relevance is symmetrical: E will confirm H on this 
analysis just in case H confirms E. But our ordinary concept of evi- 
dence involves a certain kind of epistemic asymmetry. Hypotheses 
are commonly taken to be supported by evidence, but not vice versa. 


* I would like to thank Jim Joyce, Mark Kaplan, Hilary Kornblith, Don Loeb, 
Patrick Maher, Derk Pereboom, and Lyle Zynda for helpful discussions, correspon- 
dence, and/or comments on earlier drafts Thanks also to the University of Ver- 
mont for research support. 
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We do not commonly say, in cases where E is evidence for H, that H 
is also (at least some) evidence for E. 

Moreover, our common-sense concept may well be indeterminate, 
in a way that will necessarily fail to match up with any precise formal 
model. Some have argued that the standard model, which is based 
on the difference between pr( H/E) and pr(H), should be replaced by 
a model based on the ratw of these quantities, or on the logarithm of 
this ratio.’ In support of such suggestions, it has been pointed out 
that, for instance, a bit of evidence that raised the probability of a hy- 
pothesis from 1/10* to 1/100 would intuitively seem much more mo- 
mentous than one that raised the probability of a hypothesis from 
.26 to .27.% On the ratio-based model, but not on the difference- 
based model, the former bit of evidence would be seen as more con- 
firmatory. But there are intuitions on the other side here, too. If we 
compared a bit of evidence that raised the probability of a hypothe- 
sis from .0001 to .001, it would likely strike us as much less powerful 
than a bit of evidence that raised a hypothesis from .1 to .9. This 
supports the difference model over the ratio model.’ What is a cop- 
firmation theorist to do? One might seek to settle the dispute in fa- 
vor of one of the two measures by supporting some of these 
intuitions and/or undermining others. Or one might reject both 
measures, and try to accommodate all the intuitions in some third 
measure. But, to my mind, it is not at all clear that this sort of intu- 
itive conflict should be seen as demanding formal solution. One 
may instead see it as a symptom of a relatively straightforward inde- 
terminateness of the ordinary concept of evidential support. 

Compare, for example, the notion of financial support. Suppose 
we are asked how we should measure the degree to which a candi- 
date C is supported by an interest group 7. One way of measunng 
this relation would be in terms of the number of dollars C received 
from I. But this measure would miss some important aspects of the 
situation. We might ask what proportion of C’s funds are supplied 
by J. Or, looking at the matter from a third angle, we might ask how 
crucial I’s support is for C (it might not be crucial even if Jis a high- 
dollars, high-percentage contributor, if C has sufficient other funds). 
Or we could use still another measure and ask what C’s position 


' For the former suggestion, see George N Schlesinger, “Measuring Degrees of 
Confirmaton,” Anais, Lv, 3 (1995). 208-12, for the latter, see Peter Milne, “A 
Bayesian Defence of Popperian Science?” Analysis, LV, 3 (1995): 213-15. 

* See Schiemnger, pp. 210-11 

* See Elliott Sober, “No Model, No Inference: A Bayesian Primer on the Gre 
Problem,” in Do Stalker, ed., Grue! The New Riddls of Induction (Chicago: Open 
Court, 1994), p. 228, where the example is credited to Ellery Eells. 
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would be if Z were the only contributor. Thinking about these differ- 
ent measures of support suggests to me that there is no single clear- 
cut question being asked when we ask ‘How much support does C 
get from J?’ It would not be surprising if the same were true of the 
question ‘How much does evidence E support hypothesis H ?’ 
Nevertheless, despite the fact that it fails to capture epistemic 
asymmetry, and despite the apparent indeterminateness of our 
pretheoretic notion, the positive-relevance model seems to be re- 
flecting something important. Surely, ‘making more probable’ cor- 
responds to something central in our notion of evidential support 
The model has been vigorously employed in Bayesian explanations 
of various tenets of scientific (and common-sense) epistemology. It 
would, I think, be surprising if some notion along the lines of post 
tive relevance were not at least an important component of our nat- 


ural notion of evidence.* 
I QUANTITATIVE CONFIRMATION AND OLD EVIDENCE 


Unfortunately, this model (along with the ratio- and log-of-ratio- 
based variants) seems to have a fatal flaw. The problem first emerges 
when considering cases in which pr(£) is taken to have the extreme 
value of 1. In those circumstances, E is treated probabilistically just 
as if it were a tautology. It cannot confirm anything, since pr(H/E) 
will be equal to pr(H}. Once one is absolutely certain about one’s 
evidence, it no longer is evidence.’ Unlike the asymmetry and inde- 
terminateness problems, the problem of old evidence threatens to vi- 
tiate the probabilistic approach to confirmation. 

Against this version of the problem, it may be protested—reason- 
ably enough, by common Bayesian standards—that we should never 
believe a nontautology with probability 1 anyway. But this response 
does not remove the problem’s sting. For it remains true that as 
pr(£) approaches 1, the degree to which E can confirm anything be- 
comes vanishingly small. The more confident we become in our evi- 
dence, the less it can be evidence. 

This quantitative version of the old-evidence problem is not re- 
stricted to the standard relevance measure. It also infects the ratio- 


t What the natural intuitive noton is may not be entirely clear. The core of the 
intuitive notion I wish to explore here takes confirmation as a relation of support 
between propositions which explains why confidence ın the supporting proposi- 
tion can help make rational confidence in the supported proposition, where the 
confidence involved comes in degrees. One might alternately see confirmation 
as most fundamentally aimed at explaining ratonal acceptance of propoantions. If 
one took this latter approach, the issues discussed in this paper might look quite 
different. See, for example, Isaac Levi, The Enterprise of Knowledge (Cambridge: 
MIT, 1980). 

* Credit for pointing out this problem goes to Clark Glymour, see his Theory and 
Evidence (Princeton: Unrversity Presa, 1980), pp 85-93 
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and log-of-ratio-based variants mentioned above. As John Earman 
points out, the problem also infects Haim Gaifman’s confirmation 
measure (l-pr(H))/(l-pr(H/E)).§ What these measures all have in 
common is that they ultimately root confirmation in the contrast be- 
tween pr(H) and pr(H/E). As pr(£) nears 1, this contrast disap- 
pears. 

A somewhat different probabilistic measure that has been champ? 
oned by I. J. Good’ is different in this respect; it is based on contrast- 
ing the likelkhoods pr(E/ H) and pr(E/~H). He calls the ratio of 
these quantities “the (Bayes) factor in favor of H provided by E” and 
the logarithm of this ratio “the weight of evidence in favor of H pro- 
vided by E.” The two likelihoods compared in Good’s measure do 
not converge as pr(£) approaches 1 in quite the same way as pr( A) 
and pr( H/E) do. Nevertheless, this measure, too, turns out to be in- 
fected by quantitative old-evidence difficulties. In cases where E con- 
firms H, the likelihood ratio ranges between 1 and %. But when 
pr(E) is high, and pr(#) is moderate—perhaps the paradigmatic sit- 
uation in which one discusses evidence—the likelihoods are forced 
to be so close that their ratio falls almost all the way toward the mini- 
mum end of this spectrum.® 

One response to this pervasive problem is simply to give up on 
finding a Bayesian analysis of our quantitative notion of confirmation. 
Suppose we discount the cases where pr(#)= 1, on the grounds that 
a model of ideal rationality need not account for cases where agents 
give probability 1 to nontautologies. Once we do this, we are left with 
an attractive analysis of qualitative confirmation, for pr( M/E) can sül 
be a tiny bit higher than pr(H} when pr(£) is very close to 1° But 

* Geifman, “On Inductive Support and Some Recent Tricks,” Erkenntnis, XXII 
(1985). 5-21; and Earman, Bayes or Bust? A Cnitcal Examination of Bayenan Confirma- 
tion Theory (Cambridge: MIT, 1992), p. 121, fn. 8. 

" See, for example, “Some Logic and History of Hypothesis Tes ” ın his Good 
Thinking: The Foundations of Probabthty and Its Apphcahons (Minneapolis: Minnesota 
UP, 1983). Thanks to Patrick Maher for prompting me to consider Good's mea 
sure. 

* For example, let pr(Z) be .99. If pr() is .75, the highest value the ratio can 
have is about 1.04. (To see this, note that in general, pr(£) = pr(H)pr(E/ H) + 
pr(~A) pr(£/~H). So in the present case, we get: .99 = .75pr(E/H) + 
25pr(k/~H). From this, ıt is clear that as pr(E/H) gets higher, pr(E/ ~H} gets 
lower, and that the ratio of pr(£/H) to pr(#/~#) will be greatest when pr(E/ H) 
is as large as possible-——that is, when it is 1. Assuming the ratio s thus maximized, 
we get .99 = .75 + .25pr(E/~H), so pr(E/~H) = .96 Thus, the maximum 
value the likelihood ratio can take when pr(E) = .99 and pr(H) = .75 is 1/.96, 
which is about 1.04.) The value is even lower if we suppose more uncertainty 
about H; if pr(H) is .6, the ratio maximum dro aade 108. And even if we 
then allow pr( F) to reach .9, the EEE GP age l. 

* For a nice defense of this position, see Mark Kaplan's Decinon Theory as Philoso- 
phy (New York. Cambridge, 1996), chapter 2. 
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many Bayesians have been reluctant to take this line. A literature has 
grown up around trying to solve the problem of old evidence. 

One of the accomplishments of this literature has been to identify 
different problems of old evidence. Some detailed taxonomies have 
been offered, but here I chiefly want to distinguish two species of 
problem. One may think of the problem, as Clark Glymour (op. at.) 
seems to have, as a problem for the Bayesian theory of change of be- 
lief. The problem here is that scientists seem to use evidence of 
which they have long been confident to rarse the probability of a hy 
pothesis. This can happen when the hypothesis is thought up at a 
point when the evidence is already known, or when someone sud- 
denly realizes that the hypothesis entails or explains the evidence. 
Most of the old-evidence literature is primarily focused on this sort 
of difficulty for the Bayesian account of belief change; let us call this 
the diachronic problem of old evidence. 

The other sort of old-evidence problem does not (at least directly) 
involve Bayesian accounts of belief change. ‘Confirmation’, after all, 
is not just the name of a kind of event. It also seems to refer to a cer- 
tain relation between propositions. Some propositions seem to help 
make it rational to believe other propositions. When our current 
confidence in E helps make rational our current confidence in H, we 
say that E confirms H. For such a relation to obtain, no event con- 
sisting of our becoming more confident in H on the basis of our con- 
fidence in E need be occurring. 

Indeed, the focus of traditional confirmation theory abstracted 
completely from particular believings. For the confirmation relation 
to hold between E and H, it was not required that E or H actually be 
believed at all. The confirmational relations were determined by 
logical relations; the connection to belief was just that, when E con- 
firmed H, belief in E would help make belief in H rational.” Now, 
contemporary Bayesians—at least those who have given up Rudolf 
Carnap’s hope of finding the objectively rational probabilistic rela- 
tions between propositions'——cannot so easily separate confirma- 
tional relations from relations involving an agent’s degrees of belief 
in propositions. Still, there seems to be a clear sense in which, given 
the beliefs of a particular agent, certain propositions support others 


* See, for example, Carl G. Hempel, “Studies m the Logic of Confirmation,” in 
his of Saentific Explanation (New York: Free Press, 1945), p. 24. 

't One who has not grven up on the objective approach is Maher. In “Subjective 
and Objective Confirmation,” Philosophy of Saence, XIN (1996): 149-74, he offers a 
highly contextualized, yet objective, probabilistic account (though of a purely qual- 
itative concept of confirmation). 
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to various degrees. The hope is that these confirmational relations 
(for a given agent at a given time) will be determined by the agent’s 
degrees of belief (which are, of course, logically constrained by the 
probability axioms). 

Unfortunately, analyzing this sort of confirmation in the standard 
Bayesian way yields the result that my high degree of confidence in E 
makes E incapable of lending significant rational support to any 
other belief. Let us call this the synchronic problem of old evidence.” 

U THE INTEREST OF THE SYNCHRONIC PROBLEM 

Various writers have tied the diachronic problem of old evidence to 
the fact that real agents are imperfect logicians. An agent who is 
confident of E may fail to see that it follows from H, and then, upon 
discovering the entailment, raise her confidence in H. This has led 
to one of the two main lines of approach to the old-evidence prob- 
lem. The standard notion of probabilistic coherence requires agents 
to have a certain sort of logical omniscience: they must give probabil- 
ity 1 to all logical truths, and respect all logical entailments perfectly 
(for example, by never having pr(P) > pr(Q) when P entails Q). 
Once this requirement is relaxed, we might devise ways of describing 
how learning that H entails E can—via probabilistic calculanon—raise 
an agent’s confidence in H.” 

This approach treats the old-evidence problem as a symptom of 
excessive idealization. Clearly, even the most epistemically perfect 
among us cannot approach logical omniscience. A theory that 
builds such omniscience centrally into its epistemic model may lose 
the ability to give a rational reconstruction of much human behavior 
that is surely epistemically praiseworthy. Thus, the project of making 
the Bayesian theory of rationality more realistic is clearly an impor- 
tant one. 


" My distinction between the diachronic and synchronic problems mirrors 
Daniel Garber's distinction between the Asstoncal and ahtstoncal problems, and Lyle 
Zynda's distinction between the problem of the confirmation eveni and that of the 

reaathon. It more roughly corresponds to Eells’s distinction between the 
problem of new old evidence, and the problems of old new evidence and old old 
edema. See Garber, “Old Evidence and Logical Omnuscience in Bayesian Confir- 
mation Theory,” in Earman, ed, Testing Saentific Theorws, Minnesota Studies tn the 
Philosophy of Saence, Volume X (Minneapols: Minnesota UP, 1983), pp 99-131; 
Zynda, “Old Evidence and New Theories,” Philosophical Studs, Lxxvit (1995): 67- 
95; and Eells, “Problems of Old Evidence,” Pactfic Philosophical Quarterty, LXVI 
(1985): 283-302. 

" See Garber; Richard Jeffrey, “Bayesianism with a Human Face,” in Testing So- 
entific Theres, and Ilkka Niiniluoto, “Novel Facts and Bayesianism,” Britsh Journal 
for ths Philosophy of Science, XXXIV (1988) 375-79. 
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Nevertheless, there is a sense in which the diachronic old- 
evidence problem, as a symptom of overidealization, leaves an im- 
portant aspect of the Bayesian model unscathed. It does not show 
Bayesianism defective as a certain kind of epistemic ideal, or model 
of rational perfection. It is quite plausible that a being who had un- 
limited intellectual powers, but limited data, would have a proba 
bilistically coherent confidence distribution over all possible 
hypotheses. For such an ideal being, the diachronic problem of old 
evidence simply would not arise. This observation at least limits the 
urgency of the diachronic old-evidence problem, if one is willing to 
allow for a good deal of idealization in thinking about epistemic 
norms. And in fact, the degree of idealization involved here should 
not be too shocking for those accustomed to thinking about ideally 
rational all-or-nothing beliefs in terms of the consistency and clo- 
sure conditions of deductive logic.” 

But the bite of the old-evidence problem goes deeper than reveal- 
ing the extent of idealization inherent in Bayesianism. This comes 
out in considering the synchronic version of the problem. Consider 
an agent who is not only currently logically omniscient, but who has 
always been so, and who has arrived at her present beliefs by faultless 
application of the idealized mechanisms of Bayesian belief revision. 
Even for such an ideal intellect, it seems clear that two things can be 
true simultaneously: (1) some beliefs will provide significant eviden- 
tial support for others; and (2) some of the evidential beliefs will be 
held with high levels of confidence. These two features are, as noted 
above, flatly incompatible on the standard Bayesian analysis of (syn- 
chronic) evidential support. Thus, no lapse from rational perfec- 
tion, even in the past, is needed for the synchronic old-evidence 
problem to arise. For this reason, the synchronic problem presents a 
particularly sharp challenge to the philosophical interest of the 
Bayesian program: it shows standard Bayesianism defective even as a 


'* It might be objected that one who was comfortable with a lot of idealization 
would have little reason to prefer Bayesianism over traditional logic in the first 
place. Is not probability theory itself basically a response to excessive idealization 
of the al-or-nothing model of belief? 

Such a comparison would be misleading. The problem with the bivalent model 
of behef 1s not just that ıt fails to describe actual imperfect human attitudes toward 
proposiuons Even a rationally ideal being would not have allornothing beliefs 
on contingent matters about which she was incompletely informed. An agent with 
this limited informanon who nevertheless harbored only bivalent beliefs would be 
less epistemically perfect, not just less realistic, than one whose confidence in such 
propositions came in degrees. Since having only allor-nothing behefs based on m 
complete information is not a rational ideal, the move to Bayesianism is not one of 
trading off idealization for realism, but one of improving the idealization. 
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model of idealized rationality. In what follows, then, I shall focus on 
the synchronic problem. 

Il. CAN THE SYNCHRONIC PROBLEM BE REDUCED TO THE DIACHRONIC? 
One reason that interest has centered around the diachronic prob- 
lem may be based on the thought that a solution to the diachronic 
problem would provide a solution to the synchronic one. Suppose 
that we had in hand a satisfactory account of confirmation events— 
events in which probabilities of hypotheses are raised by evidence. A 
given agent's body of belief will be the result, in part, of a history of 
such events. Given such a history, Ellery Eells” writes that “it seems 
appropriate to say that E ts (actual) evidence for T, for a given individ- 
ual, if, at some time in the past, the event of its confirming T, for 
that individual, took place” (op. cdt., p. 287). Earman" concurs with 
this approach, which I shall call the historical approach to the syn- 
chronic problem. 

The historical approach clearly gives up on the idea that confirma- 
tional relations (for a given agent at a given time) are determined by 
the agent's present degrees of belief. Still, there is something attrac- 
tive about the idea. And if it were successful, the synchronic prob- 
lem of old evidence would be reduced to the diachronic problem. 
In that case, the entire old-evidence problem might seem to stem 
from overidealization. 

It turns out, however, that the historical approach overestimates 
the connection between the two concepts of confirmation. To see 
this, consider an agent who is wondering whether deer live in a 
nearby wood. He comes across a pile of deer droppings, and his 
confidence in the deer hypothesis increases to near 1. Shortly there- 
after, he finds a shed antler. Since his confidence in the deer hy- 
pothesis is already so high, this new evidence does not have any 
significant impact on it. Now, subsequent to the agent’s finding the 
second piece of evidence, let us ask whether the evidence about the 
droppings and the antler provide rational support for our agent’s 
belief about deer. Intuitively, the droppings and the antler provide 
equally strong rational support for the agent’s deer belief. There is, 
I think, no sense in which the droppings currently provide stronger 
support or better evidence than does the antler. But only the drop- 
pings are historically associated with a significant increase in the 


» Eells distinguishes ‘confirming’ from ing evidence for’, but I shall use the 
terms interchangeably. His analysis is intended to capture the sense in which E 
supports H after Eis known. 

“ Bayes or Bust?, chapter 5. 
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agent’s probability for deer. The historical approach thus makes 
contemporary evidential support depend in an unintuitive way on 
the order in which evidence was discovered.” Intuitively, synchronic 
support should depend on the agent’s present epistemic state, not 
on such historical accidents. Clearly, the synchronic problem cam 


not be reduced to the diachronic one in this way.” 
IV. THE COUNTERFACTUAL APPROACH 


The most immediately appealing, and most widely criticized, direct 
approach to the synchronic problem relies on counterfactual de- 
grees of belief. For an agent who knows E, this approach would ana 
lyze confirmation with the standard positive-relevance definition; 
however, it would be applied not to the agent’s actual probabilities, 
but to what those probabilities would have been in circumstances 
where the agent did not know E.” 

This approach, like the historical approach, would abandon see- 
ing confirmation relations as determined by an agent’s probabilities. 
Stll, the intention seems to be to make confirmation relative to a 
probability function that is in a sense as close to the agent’s as possi 
ble, consistent with the agent’s not knowing £. 


7 I make this argument briefly in a review of Earman’s Bayes or Bust? in Philosoph- 
ical Review, CO (1994): 345-47. Maher argues for the same conchiwon using a 
somewhat different type of example. In his example, the bearing of the evidence 
on a hypothesis changes as background knowledge changes. In such situations, a 
bit of evidence that intuitively confirms a certain hypothesis today might have dis- 
confirmed it when it was first discovered. 

* One might object that this sort of problem need not trouble a historical analy- 
sis of quaktaive confirmation. (Earman is clearly interested in a Bayesian analysis 
of the quantitative notion, but Eells may have intended his argument to apply only 
to a qualitative notion ) The pomt would be that finding the antler presumably 
would have raised the probability of deer at least a tiny bit, given that the probabil 
ity for deer would always remain a bit shy of unity. 

But as mentioned above, it is not obvious that the old-evidence problem arises at 
all, if we seek only a qualitative analysis of confirmation and preclude complete 
confidence in contingent matters. In this case, it is not clear that there is any rea 
son to substitute the historical account for the standard, nonhistorical analysis. 

On the other hand, if we do allow agents to believe contingent matters with 
probability 1, then it would be hard to see why the deer hypothesis could not attain 
this level. And if it did, then the historical account of even qualitative confirma 
tion would fail in just the manner described in the text. Evidence that was discov 
ered subsequent to the hypothesis’ being given probability 1 could not increase its 
probability. Such evidence might nevertheless provide just as much rational sup- 
port for the hypothesis as would the evidence that historically helped raise its prob- 
ability. 

Finally, the sort of example given by Maher would tell against qualitative versions 
of the historical account even if we did not allow full belief in contingent propost 
tions 

* This approach is tentatively endorsed in Garber and embraced more whole- 
heartedly in Colin Howson, “Bayexianism and Support by Novel Facts,” British Jour 
nal for the Philosophy of Science, xxxv (1984): 245-51. 
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But will the counterfactual move allow us to capture what we want 
to capture from the agent’s actual probability function? Various 
writers” have noticed problems here. For one, it is not clear that 
the counterfactual stipulation determines any unique probability 
function. Moreover, in certain cases, the distribution that the agent 
would have in the counterfactual situation would be produced by ir- 
rational factors that would accompany the nonbelief in E2; in such 
cases, the counterfactual degrees of belief give the wrong result.” 
These sorts of cases highlight the problem of getting too far away 
from the agent’s real probabilities that were supposed to ground 
the confirmation relation. The factors that supplement the agent’s 
probabilities in determining the truth of the counterfactuals are of- 
ten irrelevant to the agent’s epistemic state. 

IV. DOES JEFFREY CONDITIONING UNDERMINE THE SYNCHRONIC NOTION? 
Another approach to the synchronic problem is to argue that, from 
the Bayesian point of view, the whole project of analyzing synchronic 
confirmation should be abandoned. Lyle Zynda (op. cit.) argues 
that, once we see the Bayesian position as allowing belief updating 
by Jeffrey conditioning as well as classical conditionalization, we can 
see that the whole Bayesian project of analyzing synchronic confir- 
mation falls apart—-and for reasons other than the old-evidence 
problem.” 

The argument begins by considering cases where pr(£) is moder- 
ate, so the old-evidence problem does not arise. For such cases, 
Zynda suggests that we should see E as confirming H only if the fol- 
lowing condition holds: for any probability function PR that comes 
from the present probability function pr by Jeffrey conditioning init- 
ated by a set of probability changes which includes raising the proba- 
bility of E PR(A) > pr(H). 

Zynda shows that this condition is not acceptable. An instance of 
Jeffrey conditioning may be initiated by changes in the probabilities 
of several propositions. Even when £ is intuitively confirmatory, an 
increase in E’s probability may be accompanied by decrease in prob- 
ability of an even-more-strongly confirmatory proposition F, resulting 
in a decrease of probability for H overall. Zynda considers modify- 


» See, for example, Glymour, Eells; Charles S. Chihara “Some Problems for 
Bayesian Confirmation Theory,” British Journal for the Philosophy of Scrence, XXXVO 
(1987): 551-60; and Earman, Bams or Bust? 

™ See Maher. 

™ The classical conditionalization principle changes a probabihty function in re- 
sponse to an evidential proposition’s being raised to 1. Jeffrey conditioning 
changes a probability function in response to initiating changes in the probabilities 
of one or more propositions. These initiating changes may be in either direction, 
and may be to nonextreme probability values. 
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ing the condition to specify that no other positively relevant proposi- 
tion is decreased in probability; however, he shows by example that 
even this modified condition excludes some cases in which E intr 
itively confirms H. 

Zynda concludes that “Jeffrey conditioning introduces a new, 
quite radical kind of contextualism: the evidential value a given 
proposition has cannot be evaluated independently of what is hap- 
pening in other regions of logical space” (op. cit, p. 77). 


Judgments of evidential support have to be relativized to particular ways 
of updating, i.e. to particular epistemic events. Once we have a particu- 
lar epistemic event in mind...judgments of evidential support make 
sense.... The quite different project of coming up with an analysis of “e 
confirms A for S at t” is not in and of itself a very fruitful one for 
Bayesians to pursue (op. dt, p. 79). 


If this argument is correct, we should give up the project of analyz- 
ing synchronic confirmation relations altogether. Once we admit 
Jeffrey conditioning, the evidential value of propositions is contextu- 
alized in a way that precludes our meaningfully asking in general 
whether E confirms H for Sat t. 

It seems to me, however, that the project of analyzing the syn- 
chronic notion of confirmation should not be dismissed this quickly. 
Recall that, in response to the first difficulty, Zynda modified his 
condition to exclude cases involving a certain kind of interference— 
cases where the probability of some other positively relevant propost 
tion Fwas reduced while E’s probability was raised. But the example 
to show that the modified condition was still unacceptable involved 
the interfering effects of another evidential proposition—a nega- 
tively relevant proposition whose probability was raised. This sug 
gests a further modification, in which the condition would also 
specify that this second sort of interference not occur. The basic 
idea here would be that, in Jeffrey conditioning, raising the probabil 
ity of a confirming proposition E will, ceteris paribus, raise the prob- 
ability of H. The ceteris paribus condition should naturally preclude 
both simultaneous reduction of probability for other confirming 
propositions, and simultaneous increase in probability for discon 
firming propositions. 

Thus remodified, the condition avoids the difficulties that faced its 
predecessors. So the fact that Jeffrey conditioning can involve mult 
ple evidential propositions simultaneously does not, after all, render 
the evidential effect of any one of these propositions contextualized 
in a way that would prevent us from asking meaningfully whether a 
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given evidential proposition confirms or disconfirms a given hypoth- 
esis (for a given agent at a given time). Indeed, it seems that allow 
ing Jeffrey conditioning leaves us in much the same position with 
respect to the synchronic notion of confirmation.” 
VL A SOLUTION TO THE SYNCHRONIC PROBLEM? 

Why does increasing pr(£) drain E of confirmatory power, as mea- 
sured by the standard account? The standard account measures con- 
firmation by how much increasing pr(E) to 1 would raise pr(H). 
This partly realizes the traditional confirmation-theoretic idea that 
E's confirming H does not require E to be believed. But, curiously, it 
does not fully accord with the idea that our current degree of belief 
in È is irrelevant to the question of how much E confirms or discon- 
firms H. Consider a situation in which an agent’s beliefs change by 
her becoming increasingly confident in E As pr(F) increases, 
pr(H) increases to take account of the increased confidence in & 
Thus, the probability of H on the assumption that E is true, and the 
probability of H all things considered, become closer. In a sense, E 
just becomes part of “all things considered.” At this point, becoming 
more certain of E cannot raise the probability of H much more, 
since there just is not room to become much more confident in E 
In such circumstances, the standard measure shows very little eviden- 
tial support. Yet intuitively, the agent’s confidence in Eis already an 
important support for her present confidence in H. This dimen- 
sion—we might call it the dimension of actualized support, as opposed 
to potential further support—is simply ignored by the standard mea- 
sure. This is the problem of old evidence. 

From this perspective, we can also see why the traditional mea- 
sure suffers from what might be called the probable-hypotheses problem. 
This is a difficulty that is very similar to the old-evidence problem, 
but much less celebrated.“ Consider cases where pr( H) gets very 
high. Again, there is in such cases little room for E (any E) to raise 
pr(H) further. The traditional measure, taking into account only 
potential further support, reflects this fact; once pr(/) is close to 1, 
no evidence will be counted as providing a significant level of sup- 


” This impression is reinforced by the observation that the remodified condi- 
tion will be satsfied whenever the standard positiverelevance condition is satisfied. 
My hunch to this effect was proved by Zynda (in correspondence; I should also 
note that he expressed his current reservations about the strong claim crincized 
above). A related theorem is proved by Carl G. Wagner in “Old Evidence and New 
Explanation,” Philosophy of Science, LXIV (1997): 677-91. 

™ This problem—or at least the hearts vernon that occurs when pr(H)=1— 
is briefly celebrated by Kaplan (p. 50), who notes its similarity to the old-evidence 
problem. 
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port for H. Oddly, once we become quite confident in a hypothesis, 
we are counted as having no significant evidential basis for our con- 
fidence! 

It seems crucial, then, that our measure of confirmation go be- 
yond measuring potential further support. The question then arises 
whether a dimension of actualized support is somehow reflected in 
the agent’s current probability function. Intuitively, the standard 
measure reflects the amount that pr(H) would rationally increase 
just as a result of the agent raising her pr(E) to 1. This potential in- 
crease in pr(H) might be thought of as resulting from two factors: 
the degree to which Fis linked to H, and the distance E has to travel 
in attaining probability 1. What we are interested in is the former 
factor, not the latter. 

How might we control for the latter factors’ influence? The 
most obvious way would be to normalize in a way that corrects for 
it. For example, if pr(£) is already .75, so that it can be raised by 
only 1/4 of the distance between 0 and 1, we can multiply the po- 
tential increase in pr(H) by 4. Lf pr(£) is already .99, so that it can 
be raised by only 1/100 of the distance between 0 and 1, we would 
multiply the potential increase in pr(H) by 100. It turns out that 
dividing the standard measure by pr(~£) achieves just this result. 
This suggests using the following as a measure of confirmational 
support: 

SEH) = [pr(H/E) — pr(H)]/pr(~E) 


Now, one might immediately object to this measure that, when 
pr(£)=I], it is undefined. But as noted above, there are good 
reasons to hold that contingent propositions should never be 
given probability 1. Thus, we may stipulate that the definition— 
which is, after all, part of an account of ideal reasoning—apply 
only when O<pr(E)<1l. This tack is especially appealing if one is 
trying to analyze confirmation in terms of standard probabilistic 
relations among an agent’s beliefs, since from this point of view, 
propositions having probability 1 are indistinguishable from tau- 
tologies. This is important because it strongly suggests that on a 
standard probability-based approach, it would be hopeless to try 
to measure some interesting kind of evidential support from 
such propositions. 

Interestingly, there is another intuitive route to essentially the 
same measure. We want to capture the support an agent’s confi- 
dence in H already receives from E (in contrast to the potential fur- 
ther support that might be gotten from raising pr(£)). We might 
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measure this by comparing the agent’s present probability for H— 
which includes that present support—with her probability for H on 
the condition that E is not true: pr(H) — pr(H/ ~E). Adding this di- 
mension of actualized support to the traditional measure of poten- 
tial further support yields: 


S*(E,A) = pr(H/E)— pr(H/ ~E) 


as a measure combining both sorts of support. If conditional proba- 
bilities are defined in the standard way, S* is also undefined when 
pr(£)=1. And in cases where 0<pr(£)<1, measure S* turns out to 
be equivalent to measure S.” 

S* is briefly considered, and rejected (for reasons that will be ex- 
amined below) by Earman.” His reason for considering it is not ei- 
ther of the intuitive arguments given above. Rather, he points out 
that conditional probabilities can be defined in a (nonstandard) 
way which allows pr(H/ ~E) to be defined when pr(£)=1. If we 
do this, then S*, unlike the standard positive-relevance measure, 
can take positive values when pr(£)=1. But the measure has other 
advantages, in addition to the intuitive naturalness explained 
above, which strike me as more important. Since the remaining 
discussion will not turn on interpreting conditional probabilities 
nonstandardly, I shall treat S* as an alternate formulation of S, and 


pr(H/E) — pr(H) 


S(E, H} = 
pr(~E) 


Expanding pr(H), we get 
pr(H/E) — [pr(H/E)pr(E) + pr(H/~E)pr(~£)] 


pr(~F) 
Regrouping the numerator gives: 
[pr(H/E) — pr(H/E)pr(E)] — pr(H/~E)pr(~E) 
pr(~£) 


Factoring pr( H/E) out of the left ade of the numerator yields. 
pr(H/E)(1 — pr(E)] — pr(H/~E)pr(~E) 
pr(~£) 
Since [1 — pr(#)] = pr(~£), this may be canceled out, leaving: 
pr(H/E) — pr(H/~E) = S*(E, H) 


™ Bayes or Bust?, pp. 120-21. 
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shall use whichever formulation is best suited to the purpose at 
hand.” 

As Earman notes, S agrees qualitatively with the standard measure 
in cases involving nonextreme probabilities. (This can be seen most 
easily by considering the first formulation given above. Since S sim- 
ply divides the standard measure by a positive number (pr(~£)), 
positive (negative) confirmation will remain positive (negative).) 
Given that the standard measure is qualitatively plausible in cases in- 
volving nonextreme probabilities, this is an advantage. 

More impressive, however, is S’s quantitative treatment of old evi 
dence. The most troubling version of the old-evidence problem occurs 
when pr(E) is high, but still short of unity. Unlike the standard mea- 
sure, S allows significant confirmation even when pr(£) is, say, .999999. 
This is an enormous advantage. It also suggests that $ may come closer 
than the standard measure to satisfying the mtuition that the degree to 
which E confirms H should not depend on how confident we are of E 

In fact, $ is, in an intuitively appealing way, insensitive to pr(£). 
Suppose an agent has a probability function relative to which the 
support E gives to H as measured by S (let us call this S-support) has 
some particular value. Let us change the agent’s probabilities by Jef- 
frey conditioning initiated by raising (or lowering) just the probabili- 
ties for E and ~E It would be nice if, relative to the agent’s new 
probability function, the Ssupport E gives to H retained the same 
value. In fact, this is the case: in general, S- support given by Eis sta- 
ble over Jeffrey conditioning on {F ~E}." 

As an illustration, consider a rational agent confronted by a stan- 
dard die he believes to be fair, which has been rolled, but is out of 
sight. Let O be the proposition that an odd number came up, and L 
be the proposition that a low number (1-3) came up. For such an 


V As this paper was under consideration, Jim Joyce sent me a manuscript chapter 
of his forthcoming book, in which he supports a version of measure S* as capturing 
one concept of evidential relevance. Joyce would apply the measure even m cases 
where pr(E)=1, by defining it via ReyniPopper measures, which work much like stan- 
dard conditional probabilites, but define pr(H/~E) even in cases where pr(~E) =0. 
Joyce interprets such measures as describing the epistemic state of someone who u 
fully certam that E is true, but soll has opinions as to the probabulty of H if she is 
wrong about E Joyce's discusson is highly recommended. While ıt has some paral 
lels with the present discussion, it has interesting differences and reaches different 
conchimons, for example, he sees the standard confirmaton measure as capturing a 
different, but no less legitimate, concept of evidennal relevance. See Joyce’s The Foun- 
dations of Causal Deason Theory (New York: Cambridge, 1999), chapter 6. 

~ This is easy to see in the $* formulation. The conditional probabilities by 
which the measure 1s defined are unchanged in Jeffrey conditioning, since Jeffrey 
conditioning involves rigidity of all probabilities conditional on the members of 
the relevant partition. 
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agent, pr(O)=1/2 and pr(L)=1/2. Using the second formulation 
given above, S(L,0O) = pr(O/L) — pr(O/~L) = 2/3 — 1/3 = 1/3: 
a modest degree of positive support. 

Now suppose the agent raises the probability of L to .99 by Jeffrey 
conditioning in which L and ~L are the only propositions directly 
affected by experience. Her new probability PR for O will, of course, 
be significantly higher: 

PR(O) = .99pr(O/L) + .Olpr(O/~L) = .66 + .00333... = .66333... 


This higher probability for O intuitively incorporates the agent’s in- 
creased confidence in L. It also renders support as standardly mea- 
sured almost nonexistent: PR(O/L) — PR(O) = .00333.... This isa 
prime example of the synchronic old-evidence problem. The low 
support figure represents the potential for becoming more certain 
of O on the basis of becoming even more certain of L, but leaves out 
the extent to which PR(O) already incorporates support from confi- 
dence in L. 

By contrast, S-support is unchanged at 1/3. Thinking in terms 
of the first formulation, it is easy to see how this figure represents 
the standard measure of support normalized by being multiplied 
by 100, since the agent’s move to full belief in L would move 
PR(L) 1/100 of the way from 0 to 1. Intuitively, it seems that S$ 
counts both potential and actualized support. Cases like this sug- 
gest that $ offers a possible solution to the synchronic problem of 


old evidence.” 
VIL THE OLD-EVIDENCE PROBLEM RETURNS? 


-* As noted above, Earman quickly rejects S* as a suitable measure of 
confirmation. His reason is that, in cases where H entails E and 


» It may be of interest to see how this sort of situation is handled by the 
likelihood-ratio-based model mentioned above. The model would measure 
confirmation in the dice case by contrasting the likelihoods pr{(L/O) and 
pr(L/~9O). In the initial situanon, these quantities are 2/3 and 1/3 respec- 
tively, so the likelihood rato is 2. But after the probability of L is raised to 
.99, the likelihoods take on new values. In particular (using values from the 
text). 


PR(L):PR(O/L) 99- (2/8 
prayo- PRU PR/E) | 99-(2/8) _ 5g, 
PR(O) 66333... 66883... 
PR(L): PR(~O/L .99 - (1/3 35 
PRL /~0) = TREY PRI~O/E) | 90/9 aii 
PR(~0) 33666... 83668... 


Thus the Kkelihood ratio shrinks to about .99/.98 = 1 01. 
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pr(E)=1, S*(EA) is just equal to pr(H). Thus, for any hypothesis 
H, all evidence which is both (1) entailed by Hand (2) believed with 
certainty is counted as confirming H to the same degree. 

Now, this problem as posed by Earman might not seem to concern 
us if we intend our measure to be restricted to cases where pr(£)<1. 
But there is a closely related phenomenon that affects even these 
cases. It derives in part from the fact that, in cases where H F E 
S(EA) = pr(H/E) (this is because pr(H/~E) = 0). This fact alone 
does not make all entailed evidence equal, however. For example, 
consider our agent who has ordinary expectations about a fair um 
seen die, and who thus has the standard simple and conditional 
probabilities for the following propositions: 


E2: the die shows a multiple of two. 
E3: the die shows a multiple of three. 
H6: the die shows sıx. 


Both £2 and E3 are entailed by H6. In this situation, both should 
confirm H6, but £3 should confirm it more strongly. And measure 
S gives just this result: S(E2,H6) = pr(H6/E2) = 1/3, while 
S(E3,H6) = pr(H6/E3) = 1/2. 

So far, we have not seen the difficulty. But, as we know from the 
old-evidence problem, as pr(£) approaches 1, pr(H/E) approaches 
pr(H). Thus, if the above agent's probabilities for both E2 and E3 
are raised to nearly 1, the probability of H6 conditional on each of 
these propositions will be very close to the unconditional probability 
of H6, and hence the support the two propositions give H6 will, ac- 
cording to S, be very nearly the same. This is a disturbing result, for 
intuitively, £3 should give more confirmation than E2, even when 
both are highly probable. Does this not show that S encounters the 
old-evidence problem after all, at least for cases where HHE? 

Closer inspection reveals, I think, that the answer to this question 
is “no.” This is not the problem of old evidence. To see this, let us 
begin by considering what might seem an inconsistency between the 
claims made above about S. 

Immediately above, we saw that, in a situation involving ordinary 
beliefs about a hidden die, S assigns E2 and E3 robustly different 
degrees of support for H6, while if E2 and E3 both become very 
highly probable, S will have to assign them very nearly the same de- 
gree of support for H6. In the previous section, however, we saw 
that Sis, in a very natural sense, insensitive to pr(#). If this is true, 
how could raising pr(£2) and pr(£3) erase the difference in the S- 
support they give H6? 
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The first thing to notice is that the previous section’s claim does 
hold in the present example. If we begin with ordinary beliefs about 
a hidden die, and increase belief in, for example, E2 by Jeffrey up- 
dating on just E2 and its negation, S(E2,H6) does not change, 
even when pr(£2) gets very high. The probabilities in such a situa- 
tion will look like this (using PR for the post-updating beliefs, and pr 
for the initial beliefs): 


PR(E2) = .99 
S(E2,H6)= PR(H6/E2) = pr(H6/E2) = 1/3 


This shows that S(£2,H6) remains unchanged. But other things 
have changed. For one thing, PR(H6) now reflects the positive evi 
dence from E2: 


PR(H6) = .99pr(H6/E2)+.0lpr(H6/~E2) = .99(1/3)+0 = .33 


More interestingly, some probabilities conditional on £3 have 
changed. The new probability for E3 itself is still 1/3, since E2 is 
probabilistically irrelevant to £3. But: 


PR(H6/E3) = PR(H6 & £3)/PR(E3) = PR(H6)/PR(E3) = 
.38/(1/3) = .99 


Because of this, the support £3 gives to H6, according to measure S, 
is now close to maximal: 


S(£3,H6) = PR(H6/E3) = .99 


So, while raising the probability of E2 did not change its confirma- 
tory power for H6, it dramatically increased the confirmatory power 
of E3. Raising the probability of one evidential proposition can thus 
affect confirmation relations involving other propositions. So the 
fact that S (E, H) is insensitive to simply raising the probability of E is 
quite consistent with the fact that raising the probability of two or 
more evidential propositions will disturb the Sconfirmation rela- 
tions involving all of them. 

This example also shows that the disturbing fact—that raising the 
probabilities of both E2 and £3 would erase the differences in con- 
firmatory power between them—is not simply an instance of the old- 
evidence problem. For either one of these same propositions can be 
believed with near certainty without any effect on its confirmatory 
power. Still, while measure S thus seems to avoid the old-evidence 
problem, it would be premature to conclude that it solves the prob- 
lem. Presumably, a solution to the problem would require a mea- 
sure which not only avoided it, but which was otherwise reasonable. 
How unreasonable are the verdicts S delivers in the disturbing cases? 
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VII A DIFFERENT PROBLEM FOR BAYESIANISM 

Let us begin by looking more carefully at the problematic case men- 
tioned above. Suppose that an agent starts with ordinary beliefs 
about a hidden die, and is told first by one highly trusted source that 
it shows a multiple of two (£2), and then by another that it shows a 
multiple of three (£3). Intuitively, £3 would seem more important 
than £2 in supporting her now-high confidence that (H6) the die is 
showing a six. But measure S must now give them almost equal con- 
firmatory power. In particular, the confirmatory power of both E2 
and £3 is nearly maximal in this case.” Raising the probability of £2 
raised the confirmatory power of #3, and vice versa, to the extent 
that the difference between them was nearly erased. 

One important question to be asked about this example is about 
exactly where the counterintuitiveness lies. The fact that £2 and £3 
have nearly equal confirmatory power derives from another fact: that 
each of them has confirmatory power close to the maximum of 1. 
But this seems to be a problematic fact about each one of the eviden- 
tial propositions, independent of questions about differences in confir- 
matory power. (If it was intuitively reasonable that raising pr(#2) 
would make £3 nearly maximally confirmatory, and vice versa, then 
presumably it would be reasonable for them to have approximately 
equal confirmatory power in that situation.) If this is right, then the 
real root of the problem should reveal itself midway through the ex- 
ample story, when the agent comes to believe strongly in £2, render- 
ing E3 almost maximally confirmatory. What, intuitively, is going on 
at this pointe 

Recall that the way learning E2 raises the confirmatory power of 
E3 so dramatically is something like this: once pr(E2) becomes 
close to 1, pr(H6/E3) gets close to 1. Since S( £3,H6) is in this case 
equivalent to pr(H6/£3), it also rises to nearly 1. Or, more infor- 
mally, what happens is that when it is almost certain that a multiple 
of two is showing, the information that a multiple of three is showing 
is virtually equivalent to the information that a six is showing. Our 
confirmation measure reflects the strength of this connection. 

Why should this sharp increase in confirmatory power be counter- 
intuitive? I think it is useful to compare this case to another. Con- 
sider a second agent, who starts off with beliefs appropriate to a fair 
die having only three (curved) sides, bearing a two, a four, and a six, 
respectively. For such an agent, it would intuitively be entirely rea- 
sonable to take £3 as maximally confirmatory. After all, with these 


* Conuder E2: pr(H6/E2) w almost 1, ance it 1s higher than pr( H6), which it- 
self is near 1. And pr(H6/~£2) 1 0, because H6+E2. Thus §(£2,H6) ıs almost 1. 
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background beliefs, £3 is virtually equivalent to H6. But now let us 
ask what the difference is between our two agents. From the point 
of view of their probabilities, the answer seems to be “not much!” 
For both, pr(£2) would be nearly 1, pr(£3) and pr(#6) close to 
1/3, and pr(H6/E3) nearly 1. For both, learning £3 would virtually 
guarantee H6. Why, then, is there such a difference in our intu- 
itions about confirmatory power in these two cases? 

The answer, it seems to me, is related to the following intuition: in 
the case with the ordinary die, if £3 is learned, it will raise pr( H6) to 
near 1, but it will be doing only part of the work of raising it from its 
baseline level (1/6), the rest being done by £2. In the three-sided 
die case, if E3 is learned, it will be doing all of the work in 
the probability of H6 from its different baseline level (1/3) to near 
certainty. 

When we think about the evidential power of £3 in the case with 
the ordinary die, we seem to assess it against 6's intuitive base-line 
level. We intuitively take the elimination of the odd-numbered pos- 
sibilities as something that was accomplished by another piece of evi- 
dence. When we think about the three-sided die case, by contrast, 
we take the absence of odd-numbered possibilities as being simply 
part of the given background conditions for our reasoning. This 
suggests that the intuitive difference between the cases depends on a 
factor to which our confirmation measure is insensitive: the distinc- 
tion between specific evidence other than E, and background as- 
sumptions. 

This point constitutes a further departure from the worry ex- 
pressed by Earman. He rejected a version of measure S because it 
equalized confirmatory power for all evidence that was both entailed 
by the hypothesis and had probability 1. We have already seen that 
essentially the same difficulty occurs at evidential probabilities below 
1. Moreover, in one such case, it seems that unintuitive equalization 
per se is not the basic problem—-rather, it is a symptom of unintu- 
itiveness in measure S’s verdicts on individual items of evidence. 
Now, if the essential problem is not about equalizing differences in 
these cases, but rather about a very general feature of confirma- 
tion—the background/evidence distinction—then we might suspect 
that the problem should occur even in situations where H does not 
entail E 

It seems to me that the fundamental problem is, in fact, quite in- 
dependent of H entailing E. To see this, consider the following vari- 
ant of our deer example: the agent is wondering whether deer live in 
a certain wood that he has not yet explored. Let H be ‘Deer live in 
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this wood’, D be ‘There are deer droppings at location x’, and A be 
“There is a shed deer antler at location y’, where x and y are some 
narrowly circumscribed locations in the wood. This case is in a sense 
diametrically opposite from the dice case, in that the evidential 
propositions are not even “close to” being entailed by H, (Since D 
and A specify particular locations for the droppings and antler, their 
probabilities even conditional on Hare not very high.) 

Now suppose the agent starts with pr(H)=.5, pr(D)=.001, and 
pr(A)=.0001. Both D and A will confirm H fairly strongly—at about 
5." Suppose, however, that the agent then finds what are almost 
certainly deer droppings at location x, sending the probabilities for 
D, and hence H, very high. This does not, of course, change D's 
level of S-support for H. This is good news. In fact, it not only 
shows, like the previous example, that S allows old evidence to con- 
firm hypotheses strongly. It also shows that S allows instances where 
E strongly confirms H even when pr(H) is very high. Thus, S does 
not seem to suffer from what we called above the problem of proba- 
ble hypotheses—the inability of the standard measure to assign sig- 
nificant degrees of confirmation to highly probable hypotheses. 

Unfortunately, not all the news is good. In particular, although S- 
support from D is unchanged, S-support from A suddenly becomes 
very low. This is because H is now very likely irrespective of A, so 
pr(H/A) and pr(H/~A) are nearly the same. This is quite unintu- 
itive; A seems just as good a sign of deer as it was before D became 
highly probable. Moreover, once the agent finds the antler and A 
becomes very probable, even S(D,H) becomes very low. Here, we 
have an even more counterintuitive case: the agent has quite reason- 
ably come to have high confidence in H, based on learning A and D. 
But according to measure S, neither A nor D now gives significant 
support to H (though, oddly, their disjunction does)!" 

This seems, at first blush, to be much like the original problem of 
probable hypotheses. But it is certainly not precisely the same prob- 
lem; we have seen that $ does not automatically preclude strong con- 


E In this situation, the probabilities for H conditonal on ~D and on ~A will be 
close to the unconditional probability of H—around 5—conforming with the intu- 
ition that the fact that there were not, for example, droppings at location x would 
have little effect on the probability of deer living in the wood. But the probabihties 
for H conditional on A and on D will be close to 1. Subtracting the former condi 
tional probabilines from the latter thus leaves $-support at about .5 in each case. 

= It is worth noting that this problem is different than that described above for 
the historical approach. There, after both pieces of evidence were discovered, 
their evidential weights depended on an irrelevant factor: the order of discovery 
The problem here is different, though no less real. 
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firmation of highly probable hypotheses. What, intuitively, is going 
wrong? 

Again, I think that it will be helpful to look at the example halfway 
through the story. Let us examine the first counterintuitive result— 
that the confirmatory power of the antler is drained when discovery 
of the droppings raises the agent’s probability for deer. At this 
point, the low value measure S assigns to A reflects the fact that, gruven 
the agent's probabilities, A’s truth or falsity is not highly relevant to H. 
But if A is not highly relevant to H, given the agent’s probabilities, 
why do we have the intuition that A is highly confirmatory? It seems 
to me that, when we think about the question of A’s evidential bear- 
ing on H, we are impressed by the fact that, given the agent’s general 
background beliefs about the relations between deer, antlers, and 
droppings, we can see that, puting aside the agent's specific evidence D, A 
would be highly relevant to H. The intuitive idea—admittedly a 
vague one—is that the agent’s degree of confidence in A would 
make a big difference to his confidence in H ceteris paribus, where 
the ceteris paribus condition would hold the agent’s general back- 
ground beliefs constant, but abstract away from the agent’s beliefs 
that are being thought of as his specific evidence about H. The 
question seems to be something like ‘How important would belief in 
A be, if it were the agent’s only evidence about H’?* 


™ Maher’s account of confirmation involves various contextual factors, among 
which u a specification of a set of evidential propositions “whose relevance to H is 
under discussion or otherwise salient in the context” (op. at, p. 167). But Maher’s 
approach differs markedly from the present one ın a number of ways. First, his ac- 
count is purely qualitative (Maher is unpersuaded by arguments that the standard 
account of qualitative confirmation 1s adequate). Thus it makes no attempt to ad- 
dress the quantitative sues with which we have been concerned. Also, Maher's ac- 
count makes use of other factors which go beyond an agent's probability 
distribution. 

Maher's analysis relatvizes confirmaton to (contextually determined) “back- 
ground evidence,” which 1s taken to be so complete that evidence E confirms H reb 
atve to that background only if E is positively relevant to H for every probability 
function that could be ranonal for an agent whose total evidence consisted of that 

und. The assumpton that the relevant background can be contextually 
determined in this way makes Maher’s account objective, in the sense that it is not 
relativized to an agent's probabilities at all But this assumption would be rejected 
by those who see confirmation as depending on past learning which 1s encoded in 
an agent's degrees of belief in a way that does not allow one to isolate out the evt 
dence on which those degrees of belief were based. Maher’s account also depends 
on an unanalyzed determination of whether some propositions are “based on an 
inference from” others. To my mind, this last aspect of the account especially 
raises troubling questions, since it seems to be presupposing a notion that is fairly 
close to (and may itself depend on) the notion being analyzed. But more detailed 
discussion of Maher’s account must await another occasion. 
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To test this diagnosis, it would be natural to construct a situa- 
tion which is similar to the deer case, but in which the probabil- 
ity of His high based on background alone. In such a situation, 
it should seem more reasonable to say that A is not strong evi- 
dence for H. But it is a bit hard to do this, for the following rea- 
son: what counts as background in a given context will depend 
on the hypothesis under discussion. If we are discussing H, then 
what intuitively counts as evidence will naturally include what- 
ever supports the agent’s high degree of belief in H. But this 
makes it hard to think of a case where H is probable on the basis 
of “background” beliefs alone. The following is, however, an at- 
tempt in this direction: 

Let H be ‘There are at least five trees in the world’, and A be 
‘There are at least four trees in Jocko’s back yard’. Suppose first 
that a person has ordinary beliefs about trees, and thus a very high 
probability for H, but does not know how many trees are in Jocko’s 
yard. In such a situation, I think that it would be unnatural to say 
that A would, for that agent, be strong evidence for H. That is be- 
cause H is so probable anyway, given the agent’s background be- 
liefs, that the agent's lack of confidence in A makes little 
difference. (Here measure S gives the intuitively reasonable ver- 
dict, since pr(H/E) is essentially the same as pr(H/~E).) Were 
the agent to become highly confident of A, it would still seem un- 
natural to say that A was good evidence of H for her (and S would 
still agree with intuition). My suggestion is that our intuitions in 
this case differ from those in the deer case because H is so proba- 
ble, given the relevant “background” beliefs, that we do not see 
confidence in A as making a significant difference to confidence in 
H ceteris paribus. Measure S gets this one right (and the deer ex- 
ample wrong) because it essentially treats everything other than E 
as “background.” 

In sum, then, it turns out that measure S does possess a signifi- 
cant advantage over the traditional measure: it avoids the old- 
evidence problem (and its first cousin, the probable-hypotheses 
problem). S cannot be seen as an accurate model of confirma- 
tion, however. The reason does not ultimately stem from the par- 
ticular features with which Earman was concerned, but rather 
stems from the fact that S is insensitive to an aspect of the way we 
think about confirmation: the distinction between specific evi- 
dence and background assumptions. The inability to account for 
this feature, of course, is not peculiar to S. It will affect any mea 
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sure that tries to analyze confirmation based purely on an agent’s 
degrees of belief.™ 
IX. CONCLUSION 


Formal models of philosophically interesting concepts are tested 
largely by seeing how well they match intuitive judgments in particu- 
lar cases. Persistent failures to find a formal model that matches our 
intuitive judgments may mean that we have yet to find the technical 
notion that corresponds to our concept. But they may also indicate 
that there is some feature of the concept itself which resists formal 
capture. 

In some cases, this can simply be due to indeterminateness. Per- 
haps the controversy between difference- and ratio-based positive rel- 
evance models of quantitative confirmation reflects a natural 
indeterminateness in the basic notion of “how much” one thing sup- 
ports another. This sort of resistance to formal capture should not, 
of course, be seen as undermining the philosophical interest of the 
formal models. If the models can show us two different ways that an 
indeterminate concept can be made precise, then they have taught 
us something. 

The problems presented by old evidence (and by probable hy- 
potheses) are not, however, of this sort. Both problems threaten to 
undermine the usefulness of the traditional model of quantitative 
confirmation (as well as its ratio-based variants). And old evidence 
does the same for confirmation models based on likelihood ratios. 
In this respect, the fact that measure S avoids those problems const- 
tutes an important advantage. 

But even measure S, as we have seen, does not succeed in match- 
ing Our intuitive quantitative confirmation judgments. And the rea- 
sons for the mismatches suggest that no purely probabilistic account 
_could match those judgments. The question then arises whether the 


” The examples considered in the text make clear that we could not avoid this 
problem even if we enriched our representation of a probability function in a way 
that would allow us to recover a “pure” mittal probability function and each bit of 
evidence that had combined with that ininal function to produce the agent's pres- 
ent function (see Brian Skyrms, “Three Ways to Give a Probability Asugnment a 
Memory,” in Farman, ed., Testing Scientific Theones, pp. 157-61). For the back- 
ground beliefs which underlie the probabibstic connecnhons between evidence and 
hypothems—for example, that deer shed antlers and drop rd aia them- 
selves based on evidence. Using the “pure” function would dissolve these connec- 
tions. On the other hand, using a function that resulted from combining the 
initial function with all the evidence except the particular item in question would 
result in seeing no significant confirmation in cases like the one involving both 
antlers and droppings. 
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fact that S avoids even an important problem like the old-evidence 
problem can have any philosophical importance. 

It seems to me that it can. Even models that fail to capture some 
important features of ordinary conceptions may be quite useful in 
helping us understand those conceptions, and the phenomena to 
which those conceptions answer, getting the cases nght is not the 
sole point of a philosophical model. To the extent that a formal 
model helps one to isolate, identify, and understand those aspects of 
confirmation which resist formal capture, the model renders impor- 
tant philosophical service. 

Sis best seen not as a correct account of confirmation, but as a 
useful tool for understanding confirmation. It models one of the 
central aspects of confirmation—the one involving E making H more 
probable. Moreover, it does this while avoiding some problems that 
limit the usefulness of the traditional probabilistic measure (and the 
other popular measures we have considered). Its avoidance of these 
problems has helped us see, by examining the very cases where S 
gives unintuitive verdicts, other aspects of our conception. It has 
long been clear that the traditional probabilistic account of confir- 
mation would fail to capture the epistemic asymmetry of our ordt 
nary notion. It now seems that probabilistic accounts will also miss 
our ordinary notion’s dependence on the distinction between back- 
ground beliefs and specific evidence. 

I would not claim that these are the only aspects of the intuitive 
conception that the measure fails to match. But we should not 
hastily take further mismatches as vitiating the philosophical interest 
of probabilistic models of quantitative confirmation. For in examin- 
ing the cases in which our formal model gives unintuitive results, 
and putting our fingers on the reasons for the mismatches, we may 
yet reveal to ourselves more clearly the contours of confirmation. 
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NOTHING, SOMETHING, INFINITY * 


L a memorable passage, Bertrand Russell’ instructs us that 


There is an argument, suggested by a passage in Plato’s Parmemdes, to 
the effect that, if there is such a number as 1, then 1 has being; but 1 1s 
not identical with being, and therefore 1 and being are two, and there- 
fore there is such a number as 2, and 2 together with 1 and being gives 
us a class of three terms, and so on. This argument is fallacious (ibid , p. 
138). 


In spite of its alleged fallaciousness, I shall try to justify a form of 
the Parmenides argument. Specifically, I am after a proof of the con- 
ditional: 


(Inf) If there exists at least one thing in the world, there exist in the 
world infinitely many things 


Read contrapositively, (Inf) asserts that, if the world does not have 
infinitely many things in it, there is nothing in it We are left with a 
stark choice: there cannot be in the world only a mere something; it 
is either infinity or nothing. 

L THE ONE-REALM PRINCIPLE 
The formulation of (Inf) speaks of ‘the world’ and ‘things’ in it. 
This calls for some discussion of what I intend by these phrases. 

Traditional metaphysical analyses distinguish, among beings, be- 
tween reals and tdeals. Reals (for example, orcas and diamonds) be- 
long in the material world, often alluded to as the first realm. Ideals 
belong elsewhere, in “other realms.” How many other such realms? 


* In the last few years, I published a trio of interrelated pa “Logic and the 
World,” Journal of Paslosophacal Logic, xvi (1989): 197-220, “The What and the 
How,” this JOURNAL, LXXXVII, 5 (May 1991): 225-44, and “The What and the How 
IL” Noûs, xxx,4 (1996): 413-38. The trio investigated the kinds of things there are 
in the world and which, if any, the world must have The present paper 1s a sequel 
applying itself to the question of just how many things there must be ın the world. 

The paper has been delivered in the last four years at vanous institutions. I am 
grateful for these discussions. Indrvidually, I owe thanks to Torin Alter, Philip 
Bricker, ‘Tyler Burge, Edmund Gettler, John Heintz, Ignacio Jane, David Kaplan, 
Ah Karmi, Calvin Normore, Seana Shiffnn, Sten Lindstrom, and Byeong-uk Yi. 
Special thanks for objections discussed in section IV are due to Guy Ro h, 
Andrew Hsu, Tony Martin, and Dominik Sklenar (on whose work on kinds, see 
low). 

' Introduction to Mathematical Philosophy (London: Unwin, 1919). By ' term, Rus- 
sell does not mean a linguistic item but an object (entity, thing). 
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Some (for example, George Berkeley and David Hume) opt for an 
exclusively mental (“psychological”) realm, often referred to now 
days as the second realm. Yet others (for example, Gottlob Frege) 
speak of an additional third realm of nonpsychological, reputedly ob- 
jective, abstracta, such as thoughts (Sinne), functions, and “logical 
objects” (for example, numbers). 

It is against the background of realm proliferation that we are to 
read our one ur-pninciple: 


(One Realm) There is only one realm of being, the real world. 


Our ur-principle propounds a unification of all beings in one re- 
ceptacle-—the real universe—generated by the “Big Bang” a few bil 
lion years ago. Whatever is—be it material, mental, or abstract—is a 
real historical being, one that came to be and existed in this one cos- 
mic realm." 

One realm, then. The proof of (Inf) is about it: if there is in the 
real universe at least one thing (for example, Nastassja Kinski), then 
there are in it infinitely many real things. But from Aristotle to 
Thomas Aquinas, from G. W. Leibniz to Immanuel Kant, and all the 
way to our logically sophisticated end of Millennium, it has been re- 
peated that the concrete (sometimes: “the sensible”) pluralities of 
the world are merely finite or, at any rate, not provably infinite. As 
long as our focus is pluralities of concreta (actresses, orca whales, or 
diamond stones), there is no proving their infinitude. 

Let this much be granted: (Inf,), “There are infinitely many con- 
crete things’, is unprovable. But what about our project, proving 
“There are infinitely many real things’ (Inf,)? It is presumed that 
the unprovability of (Inf,) transfers to that of (Inf,). What lies be- 
hind the exclusion of a category of nonconcrete (“abstract”) worldly 
things? 

I.1. The worldly abstracta dilemma. For our paradigm of concrete 
subjects, consider a mischievous Puget Sound orca, Godiva, and the 
resplendent diamond on Liz's ring, Shine. Each such concrete ob- 
ject is both (i) objective and (ii) worldly (wirklich). Call the conjunc- 


* A more canonical gioes of the adjectrve ‘worldly’ (‘real’) shows up later in the 
paper (see ‘the reality quintet’). At this point, I follow this ‘do-not-definedtaseat’ 
procedure because I find the appeal of the nodon—worldly, real—independent of 
my own specific annotations. For example, Frege's gloss of ‘real’ (‘werkhch’) suits 
us very well. “The world of the real is a world in which this acts on that, changes it 
and again experiences reactions itself and 1s changed by them All this is a process 
m ume. We will hardly recognize what is nmeless and unchangeable as real"— 
fen Thought,” in P. F. Strawson, ed., Philosophical Logic (New York: Oxford, 1967) 
p. 37. 
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tion of traits the mundane conjunction, with both meanings of ‘mun- 
dane’ in mind: the in-the-world and the run-of-the-mill character of 
the conjunction of traits. But mundane as the combination is for 
Shine and Godiva, it appears intrinsically impossible for the ab- 
stracta manifested by this pair. 

With these abstracta, it is either/or. Make it (the number Two, 
the species Orca, the substance Diamond, the colors Black and Blue) 
objective, as Platonists have, and it is banished away to another, oth- 
erworldly and acosmic, third realm. Alternatively, we may follow the 
nominalist and (like, for example, John Locke) make each such ab- 
stractum into an individual mental abstraction. Less “individualisti- 
cally,” we may view such abstracta as essentially dependent on our 
linguistic use of the classificatory predicates *...are two’ and ‘...is an 
orca’ and ‘...is blue’. Either way, we have brought the abstractum 
back into the world—the psycholinguistic second realm—but at the 
cost of losing its objectivity. 

This then is our problem: the abstractum (say, the species Orca) 
cannot co-exist in the first cosmic realm with the concreta (Godiva, 
Gieronimo, and the like) that realize it, make it real. The problem is 
metaphysical and is not to be confused with Paul Benacerraf’s 
dilemma, which is essentially epistemic. Our problem does not con- 
cern sensory access and forms of knowledge; it arises for an omni- 
scient being like God: even He cannot make abstract objects bear 
the mundane conjunction, be both objective and in the cosmos. 
Even for Him it is either/or, what I shall call the worldly abstracta 
dilemma. 

I.2. The worldly infinsiy dilemma. The worldly abstracta dilemma is 
our root problem. It leads to a derivative quandary: no plurality of 
things may be both of worldly constituents and yet contain infinitely 
many items. It is again either/or. The choice we are forced to make 
may be impressed on us by considering two classical arguments for 
(Inf), one due to Frege, the other to Richard Dedekind. 

Frege’s argument first. He begins with an ideal, off-world item, 
the predicative concept (Frege speaking here) ‘not identical to 
itself’, fy. We now assign Ñ a yet higher (second) level predicative 
concept, a number concept, ZERO. Sull on logical grounds, we now 
“take” the extension of ZERO. This is by Frege’s lights a pure “logt 
cal object,” the number 0. We existentially generalize: there exists at 
least one object and 0 is it. Frege now ascends back to the realm of 
predicates and considers this time the concept F, ‘be identical to 0’. 
Correlated with it is a number concept (namely, ONE) the number 
of things falling under A. The extension of ONE is a new object, the 
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number 1. By now, Frege can prove the existence of two logical ob- 
jects, 0 and 1. And so it goes: the process yields infinitely many items 
when Frege proves by induction on n that the extension of ‘number 
of items smaller or equal to 7’ is always n+1. 

What has Frege proved? The infinitude of what he calls logical ob- 
jects, predicative concept derivatives, items that have it built into 
their very identity that they are extensions of concepts. Therein lies 
Frege’s version of our dilemma: insist on wirklich subjects, and no 
proof of (Inf) is forthcoming; shift to off-world predicative abstracta, 
and a proof becomes available. 

A similar message is imparted by the main other classical proof I 
am familiar with, Dedekind’s. In section 66 of his Was Sind und Was 
Sollen die Zahlen?’ Dedekind tries to prove that there exists at least 
one infinite “system” of objects. On his way, he argues that there is 
an infinite plurality of objects. 

Dedekind locates the infinite plurality in his Gedankenwelt. His ur 
object is his Ego. He then goes on to apply to this realm of thought 
a one-one map fthat is not onto. The informal gloss of fis ‘can be 
the object of my thought’. We are told without further ado that fis 
one-one. And obviously it is not onto: Ego is not got as a result of 
the map. This much makes the realm of these Gedanken-objects 
Dedekind-infinite, hence infinite simpliciter. But there remains the 
question: What precisely is proved by Dedekind? 

Russell's commentary says it all: 

We are then to suppose that starting say with Socrates, there is the idea 

of Socrates, and so on ad inf. Now it is plain that this is not the case in 

the sense that all these ideas have actual empirical existence in people's 
minds. Beyond the third or fourth stage they become mythical. If the 
argument is to be upheld, the “ideas” intended must be Platonic ideas 

laid up in heaven, for certainly they are not on earth (op. at, p. 189). 


The worldly infinity dilemma, all over again. 
IL DEFUSING THE WORLDLY ABSTRACTA DILEMMA * 

Consider again the map used in Dedekind’s proof. Instead of his 
original ‘can be the object of my thought’, substitute ‘set of’. This 
operation is one-one: if x+y, then {x}#{¥]}; and it is not onto; what- 
ever it exactly is, ‘meine eigenes Ich’ is surely not a set. We thus 
have on hand a (Dedekind)-infinite plurality of real items. Was the 
above allusion to dilemmas much ado about nothing? 

I shall not rest my case on the category of sets. Granted, sets, as 
characterized by Dedekind, Georg Cantor, and eventually Ernest 


* Braunschweig Viewey, 1960, eighth edition. 
* Tam grateful in this section to Andrew Hsu and Guy Rohrbaugh. 
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Zermelo, are not mere predicative projections, like Frege’s exten- 
sions or Russell's propositional functions. Nonetheless, sets are not 
what I here target by ‘real things’. It is thus time to set out in detail 
what I intend by real (worldly, cosmological, wirklich) items, why sets 
will not do and which abstracta will. 

Five marks of the real suggest themselves, what I shall call hence- 


forth the reality quintet: 


(1) No prefabricated individuation 

(2) No reduced identity 

(3) Historicity I: constitutional change 

(4) Historicity I: predicative change 

(5) Histoncity MI: counterfactual developments 


Consider first the principle of no prefabricated individuation cri 
teria. With real items (for example, Godiva), it is the actual exis- 
tence of the entity, its coming into history and development in it that 
fixes whatever is distinctive about it. No set of antecedent condi- 
tions—prior to the historical item—may define or dictate what it is 
to be this specific orca whale Godiva. 

A very different reading of individuation principles applies to sets 
and Goodmanian sums. Here, we do have axioms of extensionality 
that regulate in advance what it is to be this specific construction (the 
set Xor the sum Y) in terms of members or parts. The fact that such 
a principle is formulable for sets and G-sums (as for other “construc- 
tions” like aggregates, tropes, and the like) is, for me, an indication 
that we do not have on hand a genuinely historical subject, pro- 
duced by the autonomous forces of cosmic evolution. 

Let us now turn to a third category of items: for example, the 
species Orca and the Manchester United soccer team. Do we get the 
profile of Shine and Godiva, or that of sets and G-sums? The species 
and the soccer team are historically real phenomena: it is only their 
actual emergence in history that fixed the distinct identity of the 
item that emerged. We date the emergence, namely, the origination 
of the species and the found:ng of the team: the species did not exist 
five billion years ago and the team did not exist in René Descartes’s 
time. No prefabricated set of criteria could legislate into existence 
this species or this soccer team. 

The second mark—no reduced identity—is related. A set (Good- 
manian sum, aggregate of parts, trope) just is (is identical to) an en- 
semble of “atoms” making it up. As such, the identity of the “higher” 
entity is reducible to a constructional operation of “composition” 
(set of, aggregate of, and so on) on more basic ingredients. In con- 
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trast, Godiva and Shine are not so reducible to “constructions” out 
of ingredients. We find the species and the soccer team sharing this 
profile. Even though both the species and the soccer team have a 
multiplicity of material “members,” neither is identical to a logical 
construction out of the membership. 

This logical point regarding identity is manifested over the tempo- 
ral dimension, when we consider the manner in which the pertinent 
entities endure in history. The set and G-sum would not exist, if it 
were not for their “ingredients” (for example, {Godsva, Shine} did not 
exist before planet earth did). But once in history, the continuing 
existence of the assembled entity is tied intrinsically to this specific 
constitution; there is no way for this set or G-sum to persist except 
through the persistence of this specific constitution. 

Not so for Godiva and Shine. Their survival through history de- 
pends on constitutional change: the higher entity goes on existing, 
as its constituting matter changes, only because it is realized in a dif- 
ferent material basis. Again, the species and the soccer team follow 
this mould: the evolution in time of this single species and soccer 
team presupposes changes in membership; it is only because the 
higher entity can sustain itself by having other members that i en- 
dures. 

Constitutional change leads us to another form of change: pred- 
icative change. Real subjects like Godiva and Shine constantly 
change the properties they bear. Such is also the profile of the 
species and the soccer team: each ages, evolves, scatters, becomes en- 
dangered—all properties borne by the higher subject and not re- 
ducible to properties borne individually by its present members. 
Not so with constructions. Doubtless, the set-theoretic pair may 
change: it had a member (Godiva) that was hungry and now it has a 
member that is bellyful. But such predicative changes are essentially 
by proxy, analyzable via changes in the intrinsic ingredients. 

The availability of an actual history leads us to a final characteristic 
mark of matters real—the potential for alternative developments of 
that history. Such counterfactual developments are surely envisage- 
able for Godiva and Shine and, in turn, for the species and the team. 
No such counterfactual developments are open to abstracta of the 
constructional kind. Just as they lack a genuine actual history, they 
are not subject to alternative evolutions of that history. 

Our reality quintet sets apart “blueprint”—constructed—abstracta 
from those which are historically real. Two observations are suggested 
by this separation. First, we have seen examples (the species Orca and 
the Manchester United soccer team) of genuine worldly abstracta, 
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bearers of the mundane conjunction; each is both objective and cos- 
mic. Our root dilemma thus turns to be a false one. Second, regard- 
ing the worldly infinity dilemma, we know what to look for: items 
which are in the world like soccer teams and species but of which 
there are infinitely many. 
IIL INFINITY IN THE WORLD 

What is proved below is a form of the Parmenides argument.’ We are 
given the existence of one (purportedly the first) concrete item: the 
orca Godiva.’ By its very primal existence, Godiva generates a variety 
of real kinds: the color-kind ‘Black’, the shape-kind ‘Oval’, the bio- 
logical species Orca‘, the kind of bodily-organ ‘Brain’. One further 
kind generated by Godiva is the arithmetical kind ‘One’. She is most 
definitely a black thing, an orca-thing, and so on, but she is also one 
thing; at that, the first among all ones. 

So now we have on hand an ordered twosome: Godiva and an 
item she generated, ‘One*. Both Godiva and ‘One’ exist—two real 
things. Thus, the first real instance of *Two' exists; in turn, *T'wot is 
generated. We now have on hand three ordered, real things: Go- 
diva, ‘One’", and *Two*. The kind ‘Three’ is generated. 

The foregoing generation of the finite ordinals produces a plural- 
ity of the following form: Godiva, ‘One’, *T'wo*,... The plurality has 
w-many members. This generates the first infinite ordinal, the kind 
twt Once it exists, we have on hand the plurality: ‘One’, ‘Two'%...., 
‘wo. Here lies the generative basis for the ordinal w+1. Past the gen- 


* The idea of “proving” any variant of (Inf) may seem passé to many a reader; we 
should rather state it as an “axiom” of infinity. Let me clanfy what is here meant 

First, I am not out to prove that there exists at least one infinite system (set, 
proper class), and so on. Second, neither Dedekind nor the present account aims 
to prove (Inf) nade a formal (be it generalized first- or second-order) logic deduc- 
tive system. It is a familiar fact that such formal logic systems (where (Inf) is ex- 
presaible) afford models with finite domains. 

What is offered here is an informal proof that (Inf)}—at that (Inf, )—is actually 
true. This much is not more ambitious than what is pursued by standard settheo- 
retc axlomatizations. Consider such an axiomatiration, say, standard ZF. The au- 
thor of the axiomatization (textbook) explains to the student why the axioms are 
true. This much applies also to the axioms of pairing, union, and so on. Such an ex- 
planation must also be given for (])—-there exists an infinite set. After all, set theo- 
ries exphcitly denying (I) are available, So, do we or do we not give a “proof” af (I)? 

The answer is that most careful authors back up their assertion of (I) with the dis- 
play of an infinite set—usually the one representing the natural numbers m the Zer- 


melo or von Neumann systems. This is the textbook de facto Indeed, such 
seck a more ambitious result than ours. For it is that the aforemen 
tioned infinite phirahty of finite sets forms m turn a single set. Our argument 


does not take the last step (though IJ do not believe there is anything wrong with it). 
* In what follows, I use the notation ‘Orca‘ to make clear I speak of the entity, 
the zoological kind of Orcas. 
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eration of ‘One’, we use two further generative principles—the very 
two used by Cantor (before he gave up on ordinals as primitive and 
reduced them to sets): the addition of one item to a given ordinal 
plurality and the “taking of a limit” of an unending plurality of ordi- 
nals, as in the case of ‘w’. 

II.1. Annotations to the argument. (1) Generation versus instantiation. 
We should separate the item(s) that bring a kind into being from 
item(s) that, once the kind exists, count as its instances. Suppose 
Adam is the first being in the world. His existence generates that of 
‘One’. Later, Eve shows up. She counts as an instance of One’, not 
as its generator. And so it goes. When Adam generates ‘One’, he 
and ‘One’, the two things that they are, generate *Two*. Later, Eve 
shows up. Adam and Eve, the two-some plurality they make, provide 
an instance of “Two'. Sull, they are not the generators of *Two', a 
role reserved for Adam and ‘Onet. 

On the present account, any msome plurality, whether of num- 
bers, cabbages, or kings, may serve as an instantiation of ‘w. But 
only the preceding sequence of ordinal numbers, up to ‘r, can gener- 
ate ‘r. In all, it is suggested both that (i) only a non-number, a 
prenumber concrete object, can generate ‘Onet and (ii) from then 
on, only numbers can generate thetr successors.’ 

(ii) Ordinals versus cardinals. The procedure described above gen- 
erates first the (finite) ordinals. The ordinal is the prior existent. 
The equivalence class that it is, cardinal Card(*Two') rests on the 
subsequent truth of the predication: ‘plurality having as many items 
as the one that generated this existent, the ordinal (*Two")’. 

In this respect, Card (*T wo") is rather like the biological kind-pred- 
icate ‘is an orca’ which I read as ‘is an instance of the kind ‘Orca’’. 
In turn, the objectual ordinal kind ‘Two* is the analog of the objec- 
tual kind ‘Orca‘. In both cases, the kind-as-object precedes the pred- 
icate ‘being a member of that kind’.* 


’ On both (i) and (ii), see more in the annotations that follow. 

* In the context of infinite numbers, the difference between generanon and in- 
stantiation, ordinals and cardinals, ıs quite clear. The ordinal ‘w’ us generated first, 
then ‘w+1* By quite a different existence pnnciple, ‘w gives rise to the cardinal 
number R Now, the instances of w and ‘wt+1’ are quite different all pluralines 
of the order type w are instances of the former, one more item in the plurality or 
an order change will be required to make a plurality that mstantiates the second 
ordinal kind. But now when it comes to instances of the first infinite cardinal, all 
such instances, regardless of their order type, would do as instances. 

The ordmal/cardinal distinction shows up clearly for infinite numbers; it also 
shows up with Zero. On the present account, Zero is not an ordinal. What “de- 
fines” Zero into existence is predication: it is the number of things falling under 
empty predicates. With, for example, Card (‘Two’), we test for one-one correspon- 
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(iti) The reality quintet. (wi. 1) Identity criteria. First on our list of marks 
was the subject's affording no prefabricated ndsviduation. Classical con- 
ceptions of number supply such individuation criteria in, for exam- 
ple, the Frege-Russell vein: the number of Fs and the number of Gs 
are one and the same if and only if the classes (concepts) of Fs and 
Gs are in one-one correspondence (lately known, under the influ- 
ence of the late George Boolos, as “Hume’s principle” and universally 
viewed as “logical,” or at the very least “analytic” and “defining”). 

On the present account, it may turn out of such a (biological, 
chemical, arithmetic, and the like) kind that some fundamental fea- 
ture of its generation process is uniquely true of it; thus, different 
persons (rivers, species, chemical elements, numbers), different gen- 
erative histories. But these are ex post facto histoncal facts, not pre- 
fabricated regulative conditions. It may thus well turn out that 
different ordinal numbers have different generative histories (differ- 
ent numbers, different ancestors) but this much is simply a descrip- 
tion of how each came to be, not a stipulated convention. 

(111.2) No reduced identity. The reduction would amount to the 
number’s being just a certain ensemble of more basic ingredients 
(parts, sets, concepts). Our number-kinds are not reducible to their 
members. The existence of members is critical for the generation 
and ongoing existence of the kind but the kind itself is never ex- 
hausted by this or that membership. 

(2.3) Historicity I: constitution change. Consider the difference be- 
tween our kind ‘Two' and, for example, von Neumann’s set ‘Two'= 
(Godiva, One} (s-quotes are to remind us the object is a set). First, 
regarding identity: von Neumann’s ‘Two’ is a specific two-membered 
set. Benacerraf’s other problem—“What numbers could not be”— 
strikes: Why is the number Two identified with this set and not any 
other two-membered set? In contrast, our kind ‘Two! is at the right 
level of generality: it comprehends all two-item pluralities as in- 
stances (the instances of the kind are two-some pluralities (for exam- 


dence with the wr-two-some plurality that generated the ordinal ‘Two’. In the 
case of Zero, we cannot so test, for there easts no such ur-exemplar and no gener- 
ated ordinal. All there is is the one - one correspondence between all empty predi 
cates. We may feel here, with Frege, that Zero is an adjective that has grown 
capitals (for example, ‘there are no(zero) unicorns’). 

The foregoing relates to a referee's question whether we could use our real- 
number-kinds (for example, ‘Two*) to number third-realm items (for exam- 
ple, two forms in Plato’s heaven or two Fregean extensions) The answer 
tracks back to my above (one realm) principle: all beings exist in this one real 
cosmological realm. What is not—is not wirklich —cannot be numbered. 
Only reals can enter real relations—be numbered by—with a real like the num- 
ber ‘Two. On which, see more in annotation (11.4). 
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ple, me and you), not the subsequent unity (the two-membered set 
(me, you})). 

Second, regarding existence. von Neumann's ‘Two’ is perilously 
dependent on the existence of Godiva: no Godiva, no *T'wo’. In con- 
trast, the kind ‘Two, like ‘Orca’, goes on existing while its members 
change. The number-kinds (again, like biological ones) were gener- 
ated by a specific ur-item—the historically frst concretum. But to go 
on existing, all they need is to be instantiated by some—any old— 
instances.’ 

(iii. 4) Historictty IT: predicative change. *Orca* and ‘Diamond’ change 
through history. But do numbers ever change? At first blush, the 
difference between the cases seems striking. But let us have a closer 
look. Granted, 9 is fixedly odd and greater than 7. But this fixidity, 
just as with the temporally stable features of *Orca*, concerns fea- 
tures fixed by (and at the time of) the very generation of the item. 
On the other hand, 9 also numbers the coins in my pocket. This is a 
feature gained by 9 after its generation and thus subject to change. 
When we say the next day ‘9 used to number the coins in my pocket 
but it no longer does’, we make a claim of de re change. some specific 
res was ONE Way, Now it is not. The question is: What res is that? 

The natural answer seems to me: the number of the coins. Of 
course, at first blush, we think the change is, to speak loosely, “in the 
coins,” not in the number. But what does it mean the change is “in 
the coins’? It could not be a de re change in the nine membered set 
{cl, c2, ..., c9}. This set cannot survive the addition or subtraction of 
acoin. Nor is it a change in the aggregate or the Goodmanian sum. 
I submit that the change in the coins induces a change in the num- 
ber but it is not identical to it—the change in the number may have 
taken place in other ways (instead of adding cl0, I may have added 


* What if we added to the reality quintet a sixth condition: all reals must at one 
time or another cease to be? Would this not make the existence of an infinity of 
our kind of numbers ephemeral? 

My answer is twofold. First, quite apart of numbers (and other real abstracta), I 
would not add the purported sixth mark of being real. Generanon—as a mark of 
the real—is not “symmetric” with corrupton. Many generated abstracta (includ- 
ing individuals, not just kinds) may go on existing by replenishing the materials 
that realize them. This gives such real subjects—as opposed to sets of such mater- 
lal ingredients-—a long lease on life. 

Second, regarding numbers. Once there was one concrete thing, there were 
numbers, And if-—per tmpassiile—-there were nothing concrete, there would be 
nothing at all. As will become clear in the next annotaton, I believe that once 
there existed something concrete, there always (and of necessity) continued to be 
some concrete item In time, the infinite generating sequence, with the specific Go- 
diva at its basis, will be gone, because Godiva will corrupt. But the numbers (the 
second place in this sequence, ‘One’, the third, *Two', ) are forever. 
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cll). Likewise, there may have been, in the concrete coins, the fob 
lowing change: the addition of one blue coin ¢l0 (the original nine 
are red). But it would be a mistake to identify the change thus in- 
duced in the number of coins with the change in the color. Although 
actually realized via the addition of cl0, the change in number and 
color might have occurred se 

One may question the foregoing emphasis on the intuition of de 
re change. Thus runs what I call the no-change objection: 


Consider “The president used to be George Bush and now it is Bill Clin- 
ton.” Rather than a change in the state of a single underlying entity, we 
have replacement of one entity by another: Bush was the president and 
another individual, Clinton, is now the president. In a similar vein, we 
may suggest for the above claims that no number, color, and so on has 
really changed. We have instead replacement: one color, green, was the 
leaves’ in the spring, now it is another, yellow; one number, 9, was re- 
placed by another 10, as the number of coins in my pocket.” 


In response to the no-change objection, I shall say this. First, we should 
separate in this discussion syntactic and metaphysical issues. Grammat- 
ically, we have on hand a uniformity between ‘Bert’s girlfriend (the U. 
S. president) is changing’, “The color(number) of the coins is chang- 
ing’. Metaphysically, going by the present paper’s outlook, species, 
temperatures, colors, and numbers are real kinds, coming in and out 
of cosmic history independently of our verbal activities. In contrast, the 
“roles” of being the American president and Bert's girlfriend are in 
duced by predications of ordinary individuals. I shall attend here to real 
kinds and leave to the aforementioned linguistic companion piece the 
proper analysis of ‘Bert’s girlfriend (the president) is changing’. 

Second, let us ask now: When I speak of “the number (color) of 
coins is changing” as involving de re change, in which res is the 
change to be located? 


4“ The nochange objection has been made to me by a variety of commentators In 
the University of California/Los Angeles Language Workshop dedicated to the 
Partee-Montague puzzle regarding the sentence "The temperature of LA is ninety 
and it is nsing’. It was also made by Francois Recanan. 

I view Montague’s o analysis of "The temperature is rising’ as in the no- 
change objection vein. He lets the prime semantic value of ‘the temperature’ be 
a function from times to numbers. He then speaks of that function as “chang 
ing” (“rimng”). In truth, the function does not change (rise) at all. It ıs as fixed 
as any thing could be, the set of ordered ttme-number pairs that it is. What is ac- 
tually encoded by the function 1s replacement: the replacement of one extension 
(90 degrees) by another (91 degrees) as the extension of ‘the temperature’. I 
discuss in detail Montague’s analysis, the nochange objection, and the syntax-se- 
mantics of such constructions in my “The Subject-intransittve Verb Clas” (in 
press). 
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The question cannot be answered without understanding the 
prechange kind of fact reported in the likes of ‘The species of Godiva 
is orca’ or ‘The color of leaves is green’. Some (for example, 
Richard Montague) regard the claims as having the form of identity 
statements. With this, I disagree. We have identity claims in the 
likes of ‘Furze is Gorse’ (species identity), the bilingual “Green is 
Verde’ (color identity), ‘34=X XXIV’ (number identity). 

In contrast to an identity, ‘The species of Godiva is Orca’ has the 
two-sorted relational form: 


R (6A) 


where c indicates the concrete relatum, A the abstract relatum. 
Idem for “The number of coins is 9’, whose form I view as the rela- 
tional: Has-number (coins in my pocket, 9). 

I read much importance into the genitive form of our construc- 
tions, namely, the abstractum mentioned is of-the-concretum (a): its 
color, temperature, species, or, in the plural, their number. Gener- 
alizing, I would like to call such assignments of a concretum-to-a 
kind kind-membership reports. 

Now, when we report change in such membership relations, we re- 
port change in a relation. The primary change is in the concrete rela- 
tum. But its (their) failure to stay related—on a given dimension 
(color, temperature, number, and so on)—to the kind (red, 90 de- 
grees Fahrenheit, 9, and so on) induces a change in this second rela- 
tum—the kind. Here, the change regards what the color, 
temperature, and number are of.” 

(1.5) Histonaty II: counterfactual developments. Nine actually num- 
bers the coins in my pocket but it might have failed to number them, 
just as Nastassja Kinski was actually the leading actress of Tess but she 
might have failed to be if she resisted Roman Polanski’s pressures. 
What makes these ‘might have’ claims true? 

‘x might have failed to F’ is true at a moment tin the actual history 
of the world if and only if there is in the actual history of x an earlier 
moment ¢* at which x could have gone on to be not F There are 
such moments—pre-1976 for Nastassja and prior to my filling my 
pocket with these nine coins for the parking meter. For concrete 
subjects like her, the necessity of bearing a trait would amount to 


* Jt has been sometimes objected to me that this is a mere “Cambridge change” 
in the species, color, and so on. Not on the present view of kinds. We saw earher 
that, for example, species identity and existence depend on (i) wat animals the 
species was generated by, and on (1i) thers beng such animals to generate it In con- 
trast, when I Cambridge-change because you had a hamburger for dinner, nothing 
relating to my very existence and identity depends on your feeding behavior. 
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there being no way her history could have gone otherwise. And in the 
same vein, this is our grounding of arithmetic necessities: although 9 
might not have numbered the coins in my pocket, it could not have 
failed—at any point in its history—to be odd or greater than 7. Not 
only do we unify Nastassja and Nine in the same cosmic realm of exis- 
tents; we also apply a unitary notion of de re necesstty—cosmic neces 
sity—to both Nastassja's being human and 9’s being odd. 

(tu) The necessary existence of infinitely many things. The idea of cos- 
mological necessity assumes through and through that there is only 
one world—the real world. For a trait F to be necessary of the real 
world, we look for all the possible developments of the world past 
the moment of its actual origination. Now, looking at Nastassja, we 
do not consider among her alternative developments, a consistent 
story about some individual looking like Nastassja and having the 
same genetic endowment as she does but living, to begin with, in 
Descartes’s time. In the same vein, we do not consider among the al- 
ternative developments of the history of this cosmos, consistent sto- 
ries about some universe that would look (by now) like ours. A story 
that makes the world have a radically different history to begin with 
(for example, a Big Bang 500 billion years ago, a substantially differ- 
ent rate of expansion, or, following Aristotelians, no beginning mo- 
ment at all) is just that—a story (“theory”). 

It is important to separate the conception of cosmic necessity from 
the more familiar “parallel worlds” conception of necessity. A sense 
of the difference in the case of numbers may be gleaned from the 
analogous and much rehearsed example of God. 

On the “parallel worlds” conception, God’s necessary existence is 
secured by his antecedent acosmic essence (definition). A plurality 
of possible worlds is given of which this historically real world is just 
one (one among many of a kind). It now turns out that in any arbi- 
trary world w, something bears the divine essence at w and that 
something exists at w; furthermore, what bears it in some world wu, 
bears it in every other world (shades of the ontological argument). 

In contrast, the cosmological necessity of God’s existence does not 
proceed from essence to transworld existence. We rather seek to es- 
tablish why, among actual existents, His is special: whereas other real 
existents may be subtracted from the history of the real world, He 
stands in a special position in the actual cosmic history and may not 
be subtracted. His necessity lies in His insubtractibility from the his- 
tory of this one and only world of ours. 

So it is for God and so it is for the numbers. On the “parallel 
worlds” conception, our Parmenidean argument secures it that in 
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any arbitrary world w, if there is one thing, there are in w infinitely 
many things. But, alas, I do not know how to argue for “parallel 
worlds” (given disjointly and point-like) that, to begin with, each 
must have something in it. 

On the cosmological conception, we start with the actual fact that 
the world itself ts. This world is not a “point” or an index of evalua- 
tion or a set of propositions; it is simply this real evolving universe. 
So, something is. But is its existence necessary? There is a point in 
the history of the world in which it could have gone on to develop 
without Nastassja Kinski, the Nanga Parbat, or planet earth. But is 
there a point tn the history of the world at which it could have gone 
to be such that it did not exist in the first place? With the necessity 
of some real thing thus secured, our Parmenidean argument carries 
us all the way to the necessity of infinitely many reals. 

(v) The generation of ‘One*. Some readers will want to grant that the 
sheer emergence of Godiva generates certain kinds (for example, 
*Orca') but not the purported kind ‘One’, simply because this last is 
not a genuine kind. There is no genuine “common nature” or “uni- 
fying essence” behind all ones. 

The present account of real kinds (for example, of biology and 
chemistry) is this. A process in history produces items of the pert 
nent kind and it is thus—and only thus—that the species and the 
substance come into history. Such is the worldly picture of ‘Orca’ 
and ‘Diamond* and such is the intended account of *One*. 

First, a concrete existent brings into the world this other, more ab- 
stract, kind of existent, the ordinal ‘One’. Eventually, all numbers 
are so generated, not “from above,” by a third or second realm, 
defining predication but “from below,” by in-history, preceding real 
existents. There is an intrinsic difference between (i) the first num- 
ber, ‘One’, which can only be generated by a non-number, and (ii) 
all remaining numbers which can be generated only by other num- 
bers, each by its predecessor. We should note that in (i)-(ii), we 
have a nice metaphysical reflection of the familiar (Peano- 
Dedekind) axioms: the tr-number (for Dedekind and for us) 1 is in- 
trinsically nota successor; all others are intrinsically successors. 

In the present framework, predecessor numbers actually precede 
the next number in the historical order of existence. Thus, the fa- 
miliar fact that 16 is “after” (succeeds) 12 is not a truth got “from 
above,” as if after 16 and 12 have been separately defined by predica- 
tion, it also turned out that a plurality with the cardinal number 16 
has “more items” than one with 12 items. When it comes to our gen- 
erated ordinals, we have here a “from below” fact about the histor+ 
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cal order of generation: 12 is among the ancestors of 16. This is 
meant not as a “metaphor” or “logical analog” of real ancestry rela- 
tions. Not at all. Without the existence of 12 in history, 16 would 
not emerge. 

One final repartee and a forceful one at that—the separate begin- 
nings objection. Take to heart the foregoing claim that ‘One’ is as 
generable as ordinary natural kinds, and it is from the frying pan 
into the fire: 


Consider the spatiotemporally separate Dobbin and Bertrand, the first 
horse-like (in looks and DNA) animals. By your ( JA’s) own lights, the 
two generate two distinct biological kinds. But you say that number- 
kinds are like biological kinds. So, either you predict that (i) the world 
must have at most one primal concrete item or else (ii) if there were 
two (or more) simultaneous primal objects (for example, Electra (E) 
and Fidelia (F)), the two would breed two distinct kinds ‘One. This is 
an odd result.” 


The source of the separate beginnings objection lies in misinterpret- 
ing the analogy between arithmetic and biological kinds. I would like 
to explain in what way they are similar and in what way they are not. 

Biological kinds (for example, 'Horses*) make prominent two sep- 
arate features: 


(Generability) For a kind to exist, it has to have concrete generators. 

(Membership connectedness). To be a (late) member of kind K (for 
example, a contemporary horse), x has to come from Ks (for ex- 
ample, from horses). 


The point to notice is that (generability) and (membership con- 
nectedness) are logically independent. I definitely asserted above 
(generability) for any real kind whatsoever. But is (membership 
connectedness) also true of kinds in general or is it something pecu- 
liar (even if essential to) biological species? 

One might be led to think that I submit (membership connected- 
ness) for all kinds simply by confusing it with (generability). But 


* The objection that follows has been made to me early on by Martin, then by 
Normore, Shiffrin, and Dominik Sklenar at UCLA and Jane in Barcelona. The dis- 
cussion of kinds that follows owes much to conversations with Sklenar about his 
1996 UCLA Ph.D. “Being of a Kind”; a published version of his interesting ideas is 
forthco : 

I should it noted that the objection has the form of a disfunction: either the 
world must have at most one primal item or else if there were more than one such, 
then.... I believe the first disjunct is true. The remarks in the previous annotation 
about the necessary existence of the world itself give a hint of the ent I 
would submit. But in what follows, I shall assume with my opponent there 
could be more than one primal item. 
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even without such a confusion, one may reason as follows regarding 
the separate question of kind-membership. 

The critical question to consider is this: What unifies two items 
x,y in a given kind K? Either the unification is “from below” or 
“from above.” If it is “from below”"—due to a cosmological fact-— 
the model must come from the unification operative in biologi- 
cal species, namely, via historical connectedness between 
members. If not, the only alternative unifier is an idea (Sinn, 
form) in another “realm” (for example, Locke's second, Frege's 
third, or Plato’s “heaven”). This either/or I shall call the untftca- 
tion dilemma. 

Driven by the dilemma, our objector may well reason thusly: the 
present approach would not allow unification by a second-or-third 
realm template. Therefore, the one intended must be unification as 
in the biological case—by historical connectedness. By hypothesis, 
there is no connectedness between Electra and Fidelia. Thus, we are 
led to two number-kinds ‘One“*. 

My answer is that the unification dilemma is a false dilemma. 
There are other “from below” ways (aside of historical connected- 
ness) of unifying things in the cosmos. A pertinent example is pro- 
vided by chemical elements, to which I now turn. 

What makes any old atom xa member of the kind ‘Germanium’? 
Let us assume that the emergence of this chemical element in the 
cosmos (so many (many) seconds after the Big Bang) is the result 
of two simultaneous but causally separate supernovas. Each super- 
nova produces from electrons and protons by a similar process an 
atom with the structure of atomic number 72. The one Germa- 
nium atom call g*, the other atom g**. In my view, both are of 
the kind “Germanium*‘. Yet surely, they are not historically con- 
nected. 

What unifies g* and g** in the kind is neither (i) historical 
connection nor (ii) satisfaction of an apriori third-(second-) 
realm template. What makes an atom a Germanium atom is the 
type of cosmic process by which it came to be: electrons/protons 
at the right temperatures (and so on) coming together to form a 
unity. 

Let me draw from the foregoing a general conclusion. What mat- 
ters for membership at the general level of all kinds is this: 


(M) what makes x and y cospecific is that the processes by which 
they each came to be are fundamentally similar (in the pert- 
nent respect: chemical, biological, and so on) and essential to 
x(y)'s very coming into being. 
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(M) is true of the historically bound tigers xy; the historically dis- 
connected germanium atoms g* and g**; and lastly, the separate si- 
multaneous primals Electra and Fidelia. 

It is essential to each and every object's (including Electra and Fi- 
delia) process of coming into being that it would come to have unity, 
that it would become one thing. The coming together of the unity 
might involve the bonding of electrons and protons, DNA mole- 
cules, stars in a galaxy, and other more complicated modes of forma- 
tion, depending on the specific kind of item concerned. But over 
and above specificities due to varying kinds, each object’s very com- 
ing into being has essentially this much in it: the coming into being 
of one thing. We might well say: no entity without numencal identity. 
Physically, Electra and Fidelia may or may not be of the same kind. 
But this much is given about the two as about any other emergent 
being: Electra and Fidelia are of a common arithmetic kind, the one 
and only ‘Onet, because in both we have the fundamentally similar 


process of a coming into being of a distinct new entity. 
IV EPILOGUE 


I have argued that there can be no existence of a mere something, 
be it a single initial something or a plurality of such primals. Each 
such primal being (as any other object) cannot exist without being 
one thing, a number that itself could not come into being without 
having a generating instance, some one concrete thing. So: no num- 
ber one without a concrete generator, but also, no such concrete 
generator without the number one. Very well then: coming into the 
world with each such primal entity is the number one; and with that 
twosome, the primal entity and one, comes into the world the num- 
ber two; and so on. We can sum it all thusly: there is no real infinity 
without real being and no real being without real infinity. 

JOSEPH ALMOG 
University of California/Los Angeles 
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PUTNAM’S MODEL-THEORETIC ARGUMENT 
RECONSTRUCTED * 


etaphysical realism has been the main subject of the critical 
Me of Hilary Putnam's work for over twenty years now. At 
the heart of this doctrine, as he presents it, is a thesis I shall 
call methodological falkinksm (MF): even an epistemically ideal theory 
may fall short of truth, where the notion of truth involved is that of 
correspondence truth (CT).’ Note that MF is not merely about epistem- 
ically ideal theories. The claim is that no empirical theory, no matter 
how good our evidence for it (that is, not even if it should be epis 
temically ideal), is guaranteed to be true. A further characteristic of 
metaphysical realism is a view of semantics sometimes called semantic 
naturalism (SN). Its credo is that “Semantics is an empirical science 
like any other.” One consequence of this view is that theories of 
reference (theories of the correspondence that, according to CT, 
has to obtain between reality and a sentence for that sentence to be 
true) are empirical theories, and are thus to be accepted or rejected 
on the basis of exactly the same criteria that govern theory choice in 
what we more traditionally think of as the natural sciences. 
Over the years, Putnam has advanced several arguments against 
metaphysical realism.’ Of these, the so-called modeHheoretic argu- 


* Tam grateful to Leon Horsten and Bas van Fraassen for helpful discussions on 
the subject matter of this paper 

' As spelled out, for instance, in Michael Devitt, Reahsm and Truth (Cambndge: 
Blackwell, 1991), p. 29 Some passages in Putnam’s writings suggest that he thinks 
MF to be a mere consequence of the realist’s view on truth—see, for instance, his 
Meaning and the Moral Saences (Boston: Routledge, 1978), p. 125. But from the fact 
that the metaphysical realist separates truth from any epistemic noton (like verf 
cation or rational acceptability), and in that sense conceives of truth as “radically 
normepistemic” (sid.), nothing follows about the (possible or necessary) truth or 
falsity of (ideal) empirical theones, at least not as long as we are not informed 
about the metaphysical realist’s view on the relanon between truth and evidence. 
That for the realist an ideally confirmed theory 1s not true by definition (as it is on 
some pragmatist conceptions of truth), and hence not necessarily true for that rea 
son, does not mean that a realist could not, on other grounds, consistently come to 
the conclusion that an ideal empincal theory must be true. As ıt happens to be, 
methodological considerations—such as the underdetermination problem—sug 
gest that this is not so, and thus motivate MF (cf Bas van Fraassen, The Saenttfic Im- 
age (New York: Oxford, 1980); John Earman, “Underdeternmination, Realism, and 
Reason,” Midwest Studses in Philosophy, xvm (1998): 19-38; and Leon Horsten and 
my “Earman on Underdetermination and Empincal Indistinguishability,” Erkennt- 
mis, IL (1998): 303-20). But that is another matter. 

* Devitt, Coming to Our Senses (New York. Cambridge, 1996), p. 48. 

’ For an overview, see Ernest Sosa, “Putnam’s Pragmatic Realism,” this JOURNAL, 
xc, 12 (December 1993). 605-26. 
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ment is the historically first—it dates back to 1976. The argument is 
claimed to be a reductio; it allegedly derives the negation of MF— 
‘Some empirical theories are necessarily true’—from the conjunction 
of MF, CT, and SN. Although its “core” is simple and uncontested, 
the modeltheoretic argument has continued to exercise philosophi 
cal minds ever since its first presentation, for it is not so simple to ad- 
Judicate how effective it is as an argument against realism. 

Much seems to hinge here on the legitimacy of a certain crucial 
move in Putnam’s argument. Already in his first presentation of the 
modeHheoretic argument, he considered, and quickly dismissed, a 
possible objection he expected his opponents to make. Commenta- 
tors have called Putnam’s dismissal—the “just more theory move,” 
as it has come to be called——“completely wrongheaded,”‘ “foul,” and 
“illicit,”* and have argued that the move is “simply begging the ques- 
tion against the realist."* I aim to show here that, at least on one 
construal of the model-theoretic argument, Putnam’s critics are 
wrong; on that construal, the “fust more theory move” is perfectly le- 
gitimate, and the argument as a whole valid and, if we lay to rest 
some (what seem to be justified) doubts concerning one of its 
premises, even sound. There is, as far as I can see, at most circum- 
stantial evidence for the claim that the construal of the model- 
theoretic argument that will be presented captures what Putnam 
“really had in mind.” I urge, however, that the question to what ex- 
tent it accords with his intentions, if at all, should not concern us too 
much, as I think that on the proposed reading, anyhow, his argu- 
ment makes much better sense than on the more common way of in- 


terpreting it. 

I THE MODEL-THEORETIC ARGUMENT 
The modeHheoretic core of Putnam’s argument is really quite sim- 
ple, and clearly correct. Let T, be an epistemically ideal theory (that 
is, a theory possessing every conceivable virtue except, perhaps, 
truth). Being ideal, it will, among other things, be consistent. It 


* James Van Cleve, “Semantic Supervenience and Referential indeterminacy,” 
this JOURNAL, LXXXIX, 7 ( July 1992): 344-61, p. 349. 

* Bob Hale and Cnspin Wngbt, “Putnam’s Model-Theoretic Argument against 
Metaphysical Realism,” in Hale and Wnght, eds., A Companion to the Philosophy of 
Language (Cambridge: Blackwell, 1997), pp. 427-57, pp. 440£. 

* Devitt, Reahsm and Truth, p. 227. The objection that Putnam’s argument ult 
mately begs the question has also been raised by (among many others) Anthony 
Brueckner, “Putnam's Model-Theoretic Argument against Metaphysical cheapie 
Analysis, XUV (1984): 134-40, Craig Hansen, “Putnam's Indeterminacy Argumen 
The Skolemization of Absolutely Everything,” Phalosopphical Studzes, LI (1987): TIO: 
and Mark Heller, “Putnam, Reference and Realism,” Msdwest Studies in Philosophy, 
xo (1988): 113-28. 
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then follows from some basic model theory that T, has models of 
every infinite cardinality. Choose from these some model M of the 
same cardinality as the world, and establish a satisfachon relation 
(SAT) by mapping M 1-1 onto the world. If T's language is then in- 
terpreted according to SAT, T, comes out true of the world, that is, 
T, is rue-on-SAT, or TRUE(SAT). 

It should be noticed that, apart from consistency, 77s idealness 
plays no role in this argument.” Hence the argument is really about 
consistent theories. The conclusion is restricted, however, by two as- 
sumptions that were tacitly made: first, that 7, has at least one inf 
nite model; and second, that the world is infinite. But if we agree to 
consider in our modal reasoning about theories only infinite worlds 
(so we do not examine what properties theories have in worlds that, 
following the assumption about the world’s size, already for cardinal 
ity considerations could not be the actual one) and if, moreover, we 
restrict our attention to theories with at least one infinite model 
(note that all “real” empirical theories are of this sort), then the fol 
lowing can be taken to be the proper conclusion of the argument 
above: 

(1) Any consistent empirical theory is TRUE(SAT) of any world on 

some satisfaction relation SAT (in fact, on indefinitely many such 
relations). 


Prima facie, this conclusion appears to be quite harmless for the 
metaphysical realist. That any empirical theory is guaranteed to be 
TRUE(SAT) for some SAT does not seem to contradict the realist 
claim MF that no empirical theory is guaranteed to be true, where, 
according to the realist, a theory is true if it is true on the “intended” 
interpretation of the language in which it is formulated. The realist 
acknowledges that there is an abundance of correspondence rela- 
tions between our vocabulary and reality, but he insists that only one 
of those is intended, only one of those is reference. So what is the 
point of the argument? 

The question is not readily answered. J reserve my own answer for 
a later section, and shall here present the common interpretation of 
how Putnam’s further moves bring the above considerations to bear 
against the realist. On this interpretation, his point is that, roughly 
put, there is no way for the metaphysical realist to discriminate be- 
tween truth-on-some-nterpretation and truth simpliciter. For, Put- 


” As was already noted in David Lewis, “Putnam’s Paradox,” Austulasan Journal of 
Sin ie Da (1984): 221-36, which also contains a hucid discussion of the question why 
Putnam himself restricts the modeHtheoretic argument to ideal theones (pp. 230£). 
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nam (on this reading) argues, from that perspective there can be but 
one distinguishing feature—call it F—between the possible inter- 
pretations of a theory’s language: some make the theory come out 
true, others make it false. So the realist has only two options in giv 
ing substance to the notion of intendedness: he can define ‘intend- 
edness’ with respect to interpretations to mean being a member of 
the class of F-interpretations (that is, of the class of interpretations 
that make the theory true) or alternatively as being a member of the 
class of non-F-interpretations. Since we certainly want our theories 
to be true, the former seems to be preferable as a definition. But 
then it indeed follows that any consistent empirical theory must be 
true, contrary to MF. (Of course, if the latter definition were cho- 
sen, it would follow that every empirical theory must be false, which 
the realist-—as, presumably, anyone else—regards to be just as ab- 
surd as the conclusion that every such theory must be true.) 
In this reasoning, two premises are crucial: 


(2) If there is not some (set of ) feature(s) F*, other than F distin- 
guishing among all the existing correspondence relations between 
the words of a theory's language and (sets of) things in the world 
the putative unique reference-determining correspondence rela 
tion, then a theory is true exactly if it is TRUE(SAT) on some corre- 
spondence relation SAT. 

(3) There is no such feature F* that complies with the metaphysical re- 
alist’s standards. 


Both in Putnam’s presentations of the modeltheoretic argument 
and in the responses to it of his realist opponents, (3) has received 
most of the attention. We shall see further on that there is an in- 
terpretation of the model-theoretic argument so that (3) is not 
among the argument’s premises. But (2), or something close to it, 
must be among the premises on any interpretation. At least it is 
hard to see how else we are to arrive at a conclusion about the 
truth simpliciter of theories from the model-theoretic point con- 
cerning the truth-on-SAT (on some SAT) of theories. Putnam 
himself says surprisingly little about this premise.® Perhaps he 
takes it to be so obvious as to need no mention or comments. | 
surmise that Putnam regards (2) to be a conceptual truth, at least 


* I discuss this premise (or rather a principle closely related to it) in greater de- 
tail in my “A Note on Global Descriptrvism and Putnam’s Model Theoretic Argu- 
ment,” Australasan journal of Philosophy (in press). See also Mary Kate McGowan, 
Reaksm or Non-Reaksm: Undeadabte in Principle, Deadable m Practice (Doctoral Disser- 
tation, Pnnceton University, 1996); and van Fraassen, “Elgin on Lewis’s Putnam’s 
Paradox,” this JOURNAL, XAY, 2 (February 1997): 85-98, p. 86, and “Putnam's Para- 
dox: Metaphysics Revamped and Evaded,” Philosophical Perspectrves, x1 (1997): 17-42. 
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from the point of view of the correspondence theorist, following 
from the conception of truth as correspondence to reality. It in- 
deed seems that, if there is no feature F* different from F, then 
TRUE(SAT) on some SAT is the only way to capture the idea of 
correspondence to reality—and under the presupposition it seems 
to capture that idea fully. One may still doubt whether the realist is 
committed to (2). But I shall here grant Putnam that (2) is a con- 
sequence of CT. Strictly speaking, then, my claim is a conditional 
one, namely, that good sense can be made of his argument pro- 
vided (2) follows from CT and is thus legitimate as a premise in an 
argument against realism. This does not weaken my case too 
much, I think, given that (2), or some thesis equivalent to it, is also 
among the premises of the argument as commonly read, and that 


realist complaints about that are rare. 
IL JUST MORE THEORY? 


(3) has stunned many of Putnam’s realist opponents. Typically, natu- 
ralistically inclined realists endorse some version of the causal theory of 
reference (CTR) according to which reference is constituted causally.’ 
In other words, they claim that there is an acceptable feature F*— 
causality—sufficient to single out reference among the welter of 
word/world relations. The realist answer to Putnam’s modeHheoretic 
argument is then obviously this: since there is no guarantee that SAT 
coincides with the causal relations constituting reference, there is no 
guarantee that T, is true (instead of just TRUE(SAT)). 

But Putnam had anticipated this response, arguing that invoking 
CTR cannot block the model-theoretic argument, since CTR is 
“just more theory.” His argument for this claim could be more de- 
tailed, but it is commonly read as follows: for CTR to be a reason- 
able candidate theory of reference in the first place, it must be 
compatible with the epistemically ideal theory T, (given the con- 
straints on 7, CTR might even be assumed to be part of that the- 
ory). So CTR can be consistently added to 7;. But now the 
model-theoretic argument, being as general as it is, can simply be 
repeated, this time for T}UCTR. Quite evidently—the argument 
on this reading continues—if we assumed that the problem the re- 
alist faces had already been solved for the part of the language in 
which CTR is couched, then this theory might plausibly be held to 
solve the problem for T, (as well as for any other theory to which 
the model-theoretic argument applies), but “how ‘causes’ can 


* See, for example, Devitt, Dengnathon (New York: Columbia, 1981) and Coming 
to Our Senses, and Clark Glymour, “Conceptual Scheming, or Confessions of a 
Metaphysical Realist,” Synthese, L1 (1982): 169-80. 
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uniquely refer is as much of a puzzle as how ‘cat’ can, on the meta- 
physical realist picture."” In other words, CTR is part of the prob- 
lem not the solution sought. To assume otherwise is to beg the 
issue. 

This “just more theory move” is the most controversial part of 
Putaam’s argument; we shall shortly discuss its legitimacy. Before 
doing so, however, we should briefly consider Putnam’s claim that 
the model-theoretic argument works only against realism because 
SN is part of that position, that is, that there is an effective response 
to the argument open to an advocate of CT and MF but explicitly 
not SN. This claim will be seen to give independent support to my 
own reconstruction of the argument as opposed to the common 
reading of it. 

The response Putnam sketches is to rely on what he calls a “magical 
theory of reference.” Such a theory at bottom comes to a denial that 
reference is anything like a natural relation, that anything informa- 
tive from a scientific perspective can be said about it.” So it could be 
claimed, for instance, to be a “surd, metaphysical truth”” that some 
one word/world relation is reference (“it just happens to be so”). Or 
some more substantive explanation of reference in terms of supernat- 
ural powers (noetic rays, intuition, direct grasping, what have you) 
might be tried. But how the details of such a magical theory of refer- 
ence are filled in does not matter; all that matters is that SN and the 
standards of acceptability it imposes on theories of reference are 
given up. 

David Lewis finds Putnam ’s claim that magical theories of reference 
can pull the trick where naturalistic ones fail puzzling. For, he thinks, 
if the “just more theory move” were legitimate, then it should work no 
matter what kind of feature F*—whether natural or non-natural—an 


opponent might appeal to. As Lewis says: 
What’s the good of holding up yet another sign, thus 


DIRECTLY GRASPS 


* Putnam, Meantng and the Moral Sciences, p. 126. See also “Models and Reality,” 
in his Reahsm and Reason (New York: Cambridge, 1983), pp. 1-25, pp. 17£., and “Is 
the Causal Structure of the Physical Itself Something Physical?” in his Reahsm with a 
Human Face (Cambridge: Harvard, 1990), pp. 80-95, pp. 85f. 

u Note that, as defined here, the “account” of reference Putnam presents in his 
Reason, Truth and History (New York. Cambridge, 1981)—according to which all 
that can be said about reference are pragmatic tautologies, such as that ‘cat’ refers 
to cats—counts as magical, too. 

2 Puram, Reason, Truth and History, p. 46. 
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or perhaps 
[runs | 


if it is still open to Putnam to challenge the determinate reference of 
the words written on the sign? ...Putnam ...has said nothing to show why 
the [‘just more theory’] treatment applies only to natural constraints 
(op. at, p. 233). 


So if reference is claimed to be singled out by some supernatural 
power, why not add ‘Reference is singled out by some supernatural 
power’, and perhaps some account of that supernatural power, to 7; 
and apply to that theory the same bit of model theory that was ap- 
plied before to T)\VCTR? The “unnaturalized” metaphysical realist 
would thereby not fare any better than the naturalized in the light of 
the modeHtheoretic argument. Although it is true that Pumam does 
not say much about why he thinks his argument only affects the nat- 
uralist, we shall see that, on the reconstruction of the argument to 
be presented, it is perfectly plausible that the distinction should be 
essential in Putnam’s eyes. 

We now turn to the evaluation of Pumam’s argument, and in 
particular to the evaluation of the “just more theory move” 
therein. As intimated earlier, realists have protested vigorously 
against his dismissal of CTR as being just more theory and have 
claimed that the gambit is itself question-begging. Let us pro temp 
suppose that the common interpretation of Putnam's model- 
theoretic argument is correct, and that it correctly locates the 
place of the “just more theory move” in the argument. Then it is 
plain how it is that he begs the issue against the realist: to come 
from ‘TUCTR is TRUE(SAT)’ to ‘TOCTR is true’, Putnam must 
assume (3). But to assume (3) is to assume that CTR fails. For if 
CTR is true, (3) cannot be. He might answer that he does not as- 
sume CTR to be false. Even if CTR is true—he might continue— 
and states a naturalistically acceptable constraint on reference, as 
the realist claims, I, Putnam, have just shown the constraint to be 
insufficient to determine reference. After all, we can find any num 
ber of satisfaction relations on which TOUCTR comes out true, 
can we not? But as virtually any commentator has pointed out, 
this would be to miss a crucial distinction, namely, that between 
an interpretation satisfying some constraint C and an interpretation 


» Cf, also Hale and Wnght, who contend that the “just more theory move” “[1Jf 
allowable ...will be available against any specific constraint the metaphysical realist 


may propose, regardless of its precise content ..” (p. 440). 
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satisfying a theory stating C. Evidently, the realist never claimed 
that reference is fixed by the words of the realist theory of refer- 
ence, but by the actual reference-determining mechanism the the- 
ory is a theory of." 

What should make us suspicious about our understanding of Put- 
nam’s argument here is that, if the standard realist analysis of the ar- 
gument is along the right lines, it would make that argument so 
evidently mistaken. It would mean that Putnam simply takes (3) for 
granted, whereas the dialectical situation quite obviously requires 
him to argue for it. Indeed, the mistake would be so obvious that it 
is hard to imagine a philosopher as astute and logically sophisticated 
as Putnam committing it. 

Could it be, then, that David Anderson” is right that the model 
theoretic argument is not the argument all or most of Putnam’s crit- 
ics have taken it to be, but that the full modeltheoretic argument is 
an argument in two stages: one stage in which Putnam argues that, 
if (3) is the case, then metaphysical realism is false; and another 
stage in which he seeks to defend (3)? His argument for (3), on An- 
derson’s reconstruction, is (in a nutshell) that, if anything could fix 
reference from a naturalist perspective, it is causality, but because 
the only notion of cause we can make sense of (physical or empirical 
cause) is generally overdetermined, causality will not work as a refer- 
ence fixer. 

It is true that at many places in his work, Putnam argues for the 
aforementioned claim, but, as Michael Devitt" rightly remarks, this is 
a separate argument against the realist and not in any sense part of 
the model-theoretic argument. Putnam’s treatment of CTR, in the 
modeHheoretic argument, has nothing to do with the specific con 
tent of CTR;” it has nothing to do with whatever problems might be- 
set the notion of cause or with any other “intrinsic” shortcomings 
CTR might have, and this is so both on the standard interpretation 


H See, for example, Giymour, Lewis; Hale and Wright, Devitt, “Realism and the 
Renegade Putnam,” Nos, xvn (1983): 201-301, Realism and Truth, “On Determin- 
ing Reference,” in Alex Burn, ed., Language and Thought (New York: de Gruyter, 
1997), pp. 112-21; Ian Hacking, Representeng and Intervening (New York: Cambridge, 
1988); Hartry Field, “Are Our Logical and Mathematical Concepts Highly Indeter- 
minate?” Midwest Studws im Philosophy, xix (1994): 391-429; and Willism Alston, A 
Reakst Conception of Truth (Ithaca: Cornell, 1996), p. 141. Hale and Wright suggest, 
and rightly reject, a further response Putnam might make at this point (p. 441). 

4 “What Is the ModeHheoretic Argument,” this JOURNAL, xc, 6 (June 1993): 
$11-22. 

* “On Determining Reference.” 

1Y As we saw, this was also noted by Hale and Wright (pp. 489ff.) and Lewis. 
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of Putnam’s argument and on mine below." Another reason to 
doubt the correctness of Anderson’s reading of the model-theoretic 
argument is that, although he is apparently aware that the argument 
is intended as a reductio of metaphysical realism (op. at, p. 312), on 
his construal it is not (or at least not obviously so): the argument for 
(3), as he presents it, draws on much more than is implied by the 
premises representing realism (whether we take these to be Pl and 
P2 in his paper or the premises CT, MF, and SN of our exposition) .” 

Must we then, after all, conclude that the common reading of Put- 
nam’s modeltheoretic argument is correct and thus that the argu- 
ment rests on an elementary mistake? I think not. The next section 
offers a more charitable rendition of the argument, one on which 
the argument comes out perfectly valid. 

I. THE ARGUMENT RECONSTRUCTED 

On my reading of Putnam’s argument, realists are right that in it he 
assumes CTR to be false. What I would want to deny is that Putnam 
is not entitled to do so. Here I shall argue that there is a way of read- 
ing his argument so that the assumption that CTR is false is legit 
mately made. On the proposed reading, the assumption that CTR is 
false is made in a subargument of the full modeHheoretic argument, 
and is discharged at the end of that subargument. Thereby we ob- 
tain a derivation of ~MF from the uncontroversial premises MF and 
(1) and from the somewhat less uncontroversial (2). It is important, 
however, that (3) is not among the premises. 

Before we go on to consider the alternative reading of Putnam’s 
argument, a few words should be said about the argument’s scope. 
Surely, Anderson is right in pointing out that, according to Pumam, 
causality is the only candidate reference fixer from a naturalistic per- 
spective. Moreover, some realists think likewise.” But as I see it, Put- 
nam’s argument in no way depends on this kind of pessimism about 
naturalistically acceptable alternatives to causality as reference fixer. 


* That Putnam at so many places in his work extensively argues against CTR it- 
self might be interpreted as an indicanon that he himself 1s doubtful about the 
“Just more theory move"—for if that works, it seems that any argument specifically 
directed CTR is otiose. In view of Putnam's general style of ee 
however, it is more plausible to assume that he seeks to show that the 
into trouble at every turn. 

» Moreover, if Putnam’s argument should indeed be intended as Anderson 
claims it is, then I think Manuel Garcfa-Carpintero is nght in pointng out that 
“the specifically modeHheoretic conmderations are irrelevant, and could lead to 
spurious debates"—"The Model Theoretic Argument: Another Turn of the Screw,” 
Erkenntnis, XLIV (1996): 305-16, p. 305 

™ See, for instance, Devitt, "Meanings Just Ain’t in the Head,” in George Boolos, 
ed., Meaning and Method (New York: Cambndge, 1990), pp. 79-104, p. 83, and Com- 
ing to Our Senses, p 160 
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In what follows, we read CTR as asserting that reference is fixed by 
causality or by any other naturalistic constraint." From the ensuing re- 
construction it will emerge that in his argument, he can even make 
the stronger assumption that CTR, read in this way, is false without 
begging any realist issues. 

The basic idea underlying the reconstruction is this: Putnam seeks 
to establish the falsity of MF. To argue toward a reductio, he assumes 
the thesis to be true. Recall now that what this thesis expresses is that 
every scientific theory, no matter how good our evidence for it, may 
be false. Assuming this to be true, it follows that CTR, being an em- 
pirical theory, may be false. It is then obviously legitimate to go on: 
“Suppose it is false...” (that is, to enter a “necessitated subproof,” as it 
is called in some proof systems for modal logic), as Putnam does, at 
least on the interpretation of his argument I favor. The following 
natural-deduction style representation of how the argument-as-I- 
think-ofit goes may be helpful.” For ease of exposition, I assume the 
variable x to range over all and only (actual and potential) consistent 
empirical theories, so that we can formalize MF as Vx0—True(x) 
(that is, the antecedent ‘if x is an empirical theory’-part of MF need 
not be expressed in the formalization of the thesis). 


[7 True(CTR)}? 
D 
ve(Vx0—7True(x)(=MF)]’  True(CTR) [=True(CTR) ]*.5 
u24 æ True (CTR) L, 
glia L(="701) OL 
~L 
~ME 


This is only a first approximation, of course. But it does suffice to 
show that, by reflecting on what MF says, we might be able to get a 
sound derivation of the negation of MF from MF and from whatever 
premises may be hidden inside D. 


x” Jt is not clear whether CTR (however read) can be remed in a first-order 
This might matter, smce Putnam’s argument, at least in its onginal pre- 
sentation, pertains only to first-order theories. In a later presentation (“Models and 
Reality,” p. 23), Putnam argues that the first-order/higherorder disunction is not 
really essential to his argument, although for higher-order languages the argument 
must be different, the same result can be obtained. Putnam may be too quick on 
this issue—cf, Hansen for some sensible comments on this point I shall here side- 
sep that discussion, however, and assume that CTR is a first-order theory. 
The reader unfamiliar with the natural deduction system employed here may 
consult, for example, Neil Tennant, Natural Logic (Edinburgh: University Press, 
1990); or Dirk van Dalen, Logic and Structure (New York: Springer, 1994, 3rd ed.). 
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In a necessitated subproof we allow, apart from the subproof’s hy- 
pothesis, only hypotheses that are necessarily true in the world in which 
all the argument’s premises are true. We so to speak enter a possible 
world?” in which a certain hypothesis—in our case —True(CTR)— 
holds, in which, of course, all tautologies hold, and, most importantly, 
in which everything is true that according to the argument’s premises is 
true in any possible world.™ (1) evidently expresses a necessary truth; it 
can be formalized thus: L] VxdSAT[TRUE(SAT)(x)]. (2) was granted 
to be a conceptual truth, at least from the perspective of the realist. If 
it is, then, from that perspective, it is also necessary, and can be formal- 
ized as: []((3) — Vx GSAT[TRUE(SAT)(x)] <> True(x))). Finally, it 
must be noted that (3) is equivalent to the assumption of CTR’s falsity 
(CTR understood in the “broad” sense defined earlier). Thus, 
~True(CTR)<>(3) is a tautology. 

We are now ready to state the subproof: 

7True(CTR) — (3 
1 [-True(CTR)] -True (CTR) — (3) 
(3 3)—Vx( JSA UE(SAT}(x)] —- True(x 
ax Wx(SSAT[TRUE(SA + True 


va Wx(SSATITRUE(SAT) (x) ] — True(x)) Vx3SATITRUE(SAT) (x) ]ve 
_p SATITRUE(SAT)(CTR)] — True(CTR) SSATITRUE(SAT) (CTR) ] 
_y'[-True(CTR)] __ True(CTR) 
ve 07 True(CTR 1 
Ou. 


What we get when we fit this into the earlier schema is a refutation of 
metaphysical realism which only depends on premises acceptable to 


5 This should not be taken too literally, of only because the assumption that 
there 1s a possible world of a certain kind may lead to 2 contradiction so that there 
can be nothing we might have entered in our subproof. 

™ So the 0 E-rule can alternatively be stated as follows: 


Co)’ Iy]? fy 
Creo wv, pr 
Oy 


where CII abbreviates (fy, ...{o hy. It is easily seen that for QE to be sound no re- 
qurements regarding the accessibility relation have to be made, which means that 
it can serve as a rule in a natural deduction presentation of any modal logic. This 
also holds true for the other modal rule involved in the reconstruction. 

= The same holds, from Devitt’s viewpoint, for CTR understood narrowly. Ac- 
cording to him, causality is the only naturalistically acceptable candidate reference 
fixer, so if CTR (narrowly understood) is false, (3) must be true, and vice versa. 


490 THE JOURNAL OF PHILOSOPHY 


the realist. In particular, (3) is not among the argument’s premises, 
as promised. Furthermore, the reconstruction immediately suggests 
an answer to Lewis’s puzzle why Putnam thinks supernatural con- 
straints are exempt from the “just more theory treatment.” Consider 
the nomnaturalist realist who endorses some magical, nonscientific, 
theory of reference MTR. From the fact that he avows MF, it does not 
follow that he is committed to ‘Possibly, MTR is false’, for MF is a the- 
sis about scientific theories; it says nothing about magical theories. 
Hence, an account of some supernatural power, or even just the 
claim that reference is fixed by supernatural means, though a theory 
in the strict sense of that word (it is a set of sentences, of course), is 
not just more theory in the sense which makes it susceptible to Put- 
nam’s modeltheoretic considerations— it is not just more empirical 
theory. It is indeed the naturalist’s insistence that semantics, properly 
pursued, is a branch of natural science that plays havoc on him; if a 
theory of reference is to be an empirical theory, then it is automat 
cally within the scope of MF. 

Thus, the reading of Putnam’s argument offered here shows the 
argument to be valid, contrary to what most if not all realists believe, 
and would in fact rightly believe if the common reading were the 
only one possible. As a further advantage, the alternative reading 
solves Lewis’s puzzle why Putnam thinks his argument only holds 
against the semantic naturalist. If my reading should not be the ‘in- 
tended one’, then, I think, the former constitute good reasons why 
Putnam should have intended it. 

IGOR DOUVEN 
Utrecht University 
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IS THERE A PLACE FOR PHILOSOPHY IN QUINE’S THEORY ?* 


n the early part of the twentieth century, the logical positivists 
| launched a powerful attack on traditional philosophy, rejecting 
the very idea of philosophy as a substantive discipline and replac- 
ing it with a practical, conventionalist, metatheoretical view of phi 
losophy. The positivist critique was based on a series of dichotomies: 
the analytic versus the synthetic, the external versus the internal, the 
apriori versus the empirical, the metatheoretical versus the object- 
theoretical, the conventional versus the factual. W. V. Quine’s attack 
on the positivists’ dichotomies was, by extension (if not by inten- 
tion), also an attack on their critique of philosophy. In time, how 
ever, his own theory took an extreme naturalistic turn which, if 
anything, deepened the schism between philosophy and knowledge. 
I would like to show that many of Quine’s early philosophical 
ideas—the denial of the analytic-synthetic distinction, the theses of 
the interconnectedness of knowledge, universal revisability, and in- 
separability of language from theory, his methodological pragma- 
tism and realism—are in fact compatible with a substantive 
philosophy. Moreover, certain inner tensions in Quine’s theory nat- 
urally lead to a new model of knowledge in which philosophy plays a 
substantive role: not as a “first philosophy,” or as a “metascience,” or 
as a “chapter in psychology,” but as an independent discipline in its 


* An earlier, abridged version of this article, entitled “On the Place of Philoso- 
phy ın Quine’s Early Theory,” appears in P. Weingartner, G Schurz, and G. Dom, 
cds., The Role of Pragmatics in Contemporary Philosophy: Papers of the 20th International 

Sympostum, Volume N (Kirchberg am Wechsel Austrian Wittgenstein 
Society, 1997), pp. 889-95. 


0022-362X/99/9610/491-524 © 1999 The Journal of Philosophy, Inc. 
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own right, alongside, continuous with, and complementary to, sci- 
ence.’ 
I. AN INNER CONFLICT 


In his 1973 paper, “The Significance of Quine’s Indeterminacy The- 
sis,” Michael Dummett’ makes the following provocative claim: 
Quine rejected the analytic-synthetic distinction in “Two Dogmas of 
Empiricism” (1951) on the ground that no coherent and philosophi 
cally significant content has ever been assigned to it Yet, ironically, 
the distinction has come to life for the first time in the very account 
that was meant to replace it: Quine’s holistic model of knowledge 
with its center-periphery structure. Dummett’s claim is based on two 
observations: (1) Quine’s center-periphery model provides us with 
notions in terms of which the analytic-synthetic distinction is well de- 
fined. Roughly, analytic statements are immune to experience, syn- 
thetic statements are vulnerable to experience (that is, their revision 
would be mandated by suitable experiences). (2) This version of 
the distinction cannot be rejected as empty, since the claim that 
there are no synthetic statements and no analytic statements (in the 
above sense) conflicts with the center-periphery model. Conclusion: 
Quine’s claim that the analytic-synthetic distinction is unfounded 
(henceforth, the negative analytic-synthetic thesis or NAS) conflicts with 
his claim that our body of knowledge has a center-periphery struc- 


' For the positivists’ view of philosophy, see a a The Logical Structurs of 
the World and Pssudoprobiems in Philosophy, R.A. George, trans. (Berkeley. California 
UP, 1967), The Logical Syntax of Language, A. Smeaton, trans. (London: Kegan Paul, 
1937); Foundations of Logic and Mathematics, International Encyclopedia of Unified 
Saec, Volume I, Number 3 (Chicago: University Press, 1939), “Empinasm, Se- 
mantics, and Ontology,” in Meaning and Necessity: A Study in Semanis and Modal 
Loge (Chicago: University Press, 1956, second en Pp. 205-21; AJ. Ayer, Lan 

Truth and Logic (London: Gollancz, 1946, second edition). By ‘Quine’s pa 
ideas’ I mean Quine’s views in “Two Dogmas of Empiricism,” in Mom a 
of View (Cambridge: Harvard, 1980, second edition), pp. 20-46; “Truth by re 
tion,” in The Ways of Paradox and Other Essays (Cambridge: Harvard, 1976, rev ed), 
pp. 77-106; “Carnap and ical Truth,” ın The Philosophy of Rudolf Carnap, P.A 
Schilpp, ed (La Salle: Open Court, 1963), pp. 385-406; and other articles from the 
same period. Ea SiE T poe nea. Nie EE of Qunne’s ideas point in the dt 
rection of a substantive philosophy for example, Paul Gochet, Ascent to Truth. 
A Cniical Examenation of Quine's Philosophy (Munich: Philosophia, 1986), p. 16. See 
also George D. Romanoe’s analysis of Quine’s conception of philosophy in Quine 
and Analytic Philosophy: The Language of Language (Cambridge: MIT, 1983), pp. 187-95. 
Romanoe’s analysis, however, 1 quite different from mine. 

* In Truth and Other Enigmas (Cambridge: Harvard, 1978), pp. 375-416. 

* In Dummett’s words: “an analytic sentence is one such no recalcitrant ex- 
perience would lead us to withdraw our assignment to it of the value true, while a 
synthetic one is one such that any adequate revision prompted by certain recalct 
trant experiences would pe ings our withdrawing an assignment to it of the value 
true” (isd, p 375). 
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ture (henceforth, the center-pertphery thesis or CP*). I shall not discuss 
Dummett’s account of the conflict here: for one thing, my point of 
view is somewhat different from his; for another, I have some diff- 
culty with certain aspects of his argument. Instead, I shall present 
the conflict in my own way and from my own perspective, namely, as 
a conflict between two epistemic methodologies: the one associated 
with NAS, the other with CP.' 

A methodological conflict. The most basic dichotomy underlying 
the analytic-synthetic distinction is that of statements grounded in 
matters of fact and statements grounded in something other than fact. 
What this other thing is varies from one philosophical theory to an- 
other, but Quine’s attack applies equally to older as well as to newer 
renderings of the distinction. Methodologically, the most important 
feature of the analytic-synthetic thesis is the bifurcation of our stan- 
dards of knowledge: the standards for the acceptance, Justification, re- 
vision of synthetic items of knowledge are different from the 
standards for the acceptance, justification, revision of analytic items. 
Quine’s revolution consists in rejecting this bifurcation: every item 
of knowledge is subject to both standards—logic may be revised in 
response to experience, observation statements may be preserved 
based on pragmatic considerations. 

Now, CP breaks this uniform picture of knowledge by positing a 
structure with two essentially different, if interconnected, zones: an 
inner zone—center-—-based on one set of standards, and an outer 
zone— pertphery— based on another. Elements in the center are ma 
nipulated using pragmatic standards, elements in the periphery us- 
ing evidential (experiential) standards. Elements located in the 
periphery stand in a privileged relation to reality from which ele- 
ments located in the center are excluded. Thus, granted that a con- 
flict with experience can be resolved by changes anywhere in the 
system, there is still an essential difference between changes occur- 
ring in the center and changes occurring in the periphery. The 
ground for changing statements in the periphery is the need to 
square these statements themselves with experience; the ground for 
changing statements in the center is the need to square other state- 
ments, namely, statements in the periphery, with experience. Truths 


* Ishall use ‘CP’ both for the “center-pertphery thess” and for the “center-periphery 


* In addition to “The Significance of Quine’s Indeterminacy Thesis,” Dummett 
also discusses the conflict in Frege. Philosophy of Language (New York. Harper, 1973), 
chapter 17. By offering my own version of the conflict, I do not wish to diminish 
my debt to Dummett; needless to say, he 1s not responsible for my version. 
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located in the center are held solely in virtue of their contribution to 
the overall working of the system, hence due to pragmatic considera- 
tions. Truths located in the periphery are held partly because of 
their contribution to the system as a whole, but largely, and most im- 
portantly, due to their own direct link to experience, that is, based 
on factual, objective considerations. Take logic, for example. There 
is no room in Quine’s model for a conflict between a logical state- 
ment, /, and an external event, ¢ analogous to, say, the conflict be- 
tween the peripheral statement, ‘It rained in San Diego on Monday, 
March Ist, 1999’, and the event of sunny skies in San Diego on that 
day. Reality never impinges as directly on logic as it does on experi- 
mental science, and logic, therefore, never occupies the periphery. 
We may change our logic in response to conflicts with experience, 
but only pragmatically and via a chain of connections starting and 
ending in the periphery. The bone of contention between NAS and 
CP is the role of fact and convention in knowledge: while a princi- 
pled division of knowledge into conventional and factual con- 
stituents is denied by NAS, such a division is affirmed by CP. By 
superimposing a center-periphery structure on the unified domain 
of knowledge postulated by NAS, Quine has de facto incorporated 
certain significant elements of the analytic-synthetic dichotomy into 
his epistemology. 
Il. OBJECTIONS 

Before turning to possible solutions, I would like to consider three 
likely objections to the reality of the conflict.’ 

A. Objection one. CP cannot be said to reproduce the analytic- 
synthetic distinction, since this distinction requires an absolute divi 
sion of statements into two mutually disjoint groups, but the center- 
periphery model represents our body of knowledge as a continuum. 
The center-periphery duality is gradual, while the analytic-synthetic 
dichotomy requires sharp boundaries. 

Response. First, I would like to emphasize that on my construal, the 
center-periphery duality is not said to reproduce the analytic-synthetic 
distinction in its entirety. At issue is a significant overlap of the two 
distinctions, not a complete identity (or inclusion). 

Second, the claim that the analytic-synthetic distinction does not 
allow a graduated progression may be true to its letter but is not true 


* Of course, there 1s no limit to the number of objections that can be raised. I 
have selected three objections which, in addition to their intinsic merits, provide 
me with the opportunity to elucidate relevant issues. 
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of its use, especially at the hands of the logical positivists. Explana- 
tion: traditionally, the analytic-synthetic distinction is a distinction 
between whole statements; but for the logical positivists, a more basic 
distinction exists between constituents of statements. This secondary 
distinction plays an especially important role in Rudolf Carnap’s re- 
ductionist project in The Logical Structure of the World (op. at). Scr 
ence, in Carnap’s construction, is built out of basic elements of two 
types: purely experiential elements (in Carnap’s construction, “ele- 
mentary experiences” and the primitive relation of “recollection of 
similarity”), and purely conventional elements (logico-mathematical 
operations and the linguistic operation of definition). The construc- 
tion proceeds in a hierarchy of stages: in stage one, purely conven- 
tional operators are applied to purely experiential elements, 
generating mixed elements of level one; in stage two, conventional 
operators (and mixed operators, if any, obtained in stage one) are 
applied to elements of level one, generating mixed elements of level 
two, and so on. We progress in this way from elementary experi- 
ences to pairs of elementary experiences, equivalence classes of ele- 
mentary experiences (under the relation of recollection of similarity 
relative to a given agent), and so forth. Somewhere along the way 
we obtain qualities (quality structures), observable physical objects, 
theoretical objects, and so on. The result is a continuum of ele- 
ments and, corresponding to it, a continuum of statements. If we or- 
der these statements according to their balance of experiential to 
conventional constituents (based on some criterion or another), we 
arrive at a progression of statements stretching from the purely syn- 
thetic to the purely analytic. We could, of course, draw a line (mark 
a region) somewhere along the continuum: statements on one side 
of the line (inside the designated region) would be “absolutely” ana- 
lytic, all the rest “absolutely” synthetic; but such a division would still, 
at bottom, be graduated. 

Consider Carnap’s analysis of a simple synthetic statement, say, 
‘All dogs are carnivores’. This statement is doubly mixed: on the 
one hand, it contains both analytic and synthetic constituents (the 
analytic operators ‘all’ and ‘if...then’, and the synthetic predicates 
‘dog’ and ‘carnivore’); on the other hand, its synthetic constituents 
themselves are decomposed into analytic and synthetic subcon- 
stituents. The concept “dog,” for example, is constructed by succes- 
sive applications of analytic operations, first, to purely synthetic 
elements and, then, to mixed elements, generating a gradual pro- 
gression of concepts (and, correspondingly, objects) from the purely 
synthetic to the increasingly analytic. Speaking objectually, this pro- 
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gression includes: 


(1) Elementary experiences and the relation of recollection of similanty 
between them (primitive). 

(2) “Dog” sense qualities: for example, the color brown (equivalence 
classes of elementary experiences under the relation of recollection 
of similarity). 

(3) “Dog” sense -qualities-at-space-time-locations: for example, brown in 
the kennel at t (n-tuples whose first four elements are real 
numbers— representing space-time locations-—followed by one or 
more “dog” sense qualities). 

(4) “Dog” points of the perceptual world (classes of “dog” sense-qualities- 
at-a-location). 

(5) “Dog” States (classes of dog perceptual points). 

(6) Individual dogs (classes of “dog” states). 

(7) The general object or species Dog (the class of all individual dogs).’ 


What, then, is the conflict between Carnap’s graduated progression 
and Quine’s NASP The conflict consists of at least three disagree- 
ments. (a) According to Carnap, our body of knowledge is ulti- 
mately built out of purely experiential and purely conventional 
elements; according to Quine, no constituent of knowledge is either 
purely experiential or purely conventional. (b) According to Car- 
nap, each element is divided into constituents whose 
experiential/conventional status is uniquely determined; according 
to Quine, no element is uniquely divisible in this way. (c) According 
to Carnap, the application and satisfaction conditions of some ele- 
ments are purely conventional while those of others purely factual 
(experiential); according to Quine, the application and satisfaction 
conditions of all elements are both factual and conventional. 

B. Objection two. The center-periphery methodology differs from 
the analytic-synthetic methodology with respect to what is arguably 
the most important principle of Quine’s theory: the principle of uni- 
versal revisability. The analytic-synthetic methodology views analytic 
statements as immune to revision, synthetic (empirical) statements 
as linked to a range of experiences whose occurrence would (under 
appropriate conditions) compel their revision. In contrast, the 
center-periphery methodology (like the NAS methodology) views no 


7 See The Logical Structure of the World, pp. 247-48 A more detailed account of 
the constructional stages of physical objects (and their concepts) appears on pp. 
175-211 
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statement as immune to revision and the revision of no statement as 
compelled by any (specific) experiences. Given this major gap be- 
tween the two methodologies, any similarity between them is of m+ 
nor significance. 

Response. The claim that the analytic-synthetic distinction is tied to 
the traditional revision principle is simply incorrect.’ This claim is 
refuted if not by the classical, Kantian model, then by the positivist 
model. In this model, analytic statements are based on linguistic 
conventions, and these are all too easy to replace. Given an infirm- 
ing experience (or a series of infirming experiences), the positivist 
model allows a wide array of reactions: (a) no revision; (b) revision 
of synthetic statements: (i) directly implicated synthetic statements 
(observational or “protocol” statements), (ii) indirectly implicated 
synthetic statements (theoretical statements); and (c) revision of an- 
alytic statements (logico-linguistic conventions and/or framework 
principles). 

Thus, discussing a case in which a protocol sentence deduced 
from a physical theory is disconfirmed by experiment, Carnap says: 


There are no established rules for the kind of change which must be 
made. For instance, the P{hysical]-rules can be altered in such a way 
that those particular primitive sentences are no longer valid; or the 
protocol-sentence can be taken as being non-valid; or again the 
L[ogicallinguistic]-rules which have been used in the deduction can 
also be changed.’ 


Elsewhere, Carnap describes a hypothetical situation in which an 
empirical discovery would naturally lead to a conventional revision: 


If certain events allegedly observed in spiritualistic séances, e.g, a ball 
moving out of a sealed box, were confirmed beyond any reasonable 
doubt, it might seem advisable to use [the conventional device of] four 
spatial coordinates [instead of the customary three]." 


But a conflict with experience, Carnap says, is also compatible with 
no revision: 


(Wihen [a scientific hypothesis] proves to be L-incompatible with cer- 
tain protocolsentences, there always exists the possibility of maintain- 
ing the hypothesis and renouncing acknowledgement of the 


* Quine himself appears to endorse this clam (see “Two Dogmas,” p. 43). Hew 
wrong, in my view, unless he has in mind a restricted version of the claim (re- 
stricted, for example, to Immanuel Kant’s vernon of the distinction or to the posi 
tivist version from a purely mternal perspective) 

” The Logical Syntax of Language, p. 317. (The two sentences are transposed ) 

* “Empiricism, Semantics, and Ontology,” pp 212-18 
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protocolsentences." 


Finally, Quine’s principle of universal revisability is explicitly en- 
dorsed by Carnap: 


[In “Two Dogmas”] Quine showas...that a scientist, who discovers a con- 
flict between his observations and his theory and who 1s therefore com- 
pelled to make a readjustment somewhere in the total system of 
science, has much latitude with respect to the place where a change is 
to be made. In this procedure, no statement is immune to revision, not 
even the statements of logic and of mathematics.... With...this I am en- 
tirely in agreement.” 

Where Carnap’s liberal policy clashes with Quine’s is in its strict bt 
furcation of the standards for revision: purely pragmatic (practical) 
standards for the revision of logico-mathematical statements; largely 
factual (theoretical, evidential) standards for the revision of physical 
statements.” 

C. Objection three. There is no real conflict between CP and NAS, 
since the two theses operate on different levels. While CP is a de- 
scriptive, behaviorist thesis, NAS is a prescriptive, normative thesis. CP 
describes scientists’ habitual practices, including their tendency to 
regard certain statements as conventional, others as factual and still 
others as occupying intermediate positions between the two ex- 
tremes; NAS says that scientists’ habitual practices are not inviolable 
principles of rational inquiry. Clearly, there is no conflict between 
the claim that scientists normally act in accordance with the analytic- 
synthetic distinction and the claim that scientists are not normatiuely 
bound by this behavior. 

Response. This is an important objection whose validity depends 
on the role assigned to CP in Quine’s argument. To identify this 
role, I propose the following analysis of the general structure of 
“Two Dogmas of Empiricism.” The paper is divided into two parts: 
sections one to five, and section six, respectively. The first part 
consists of a series of arguments to the effect that (a) the analytic- 
synthetic distinction is not well defined, and (b) the analytic-syn- 
thetic distinction is connected with a failed epistemology (namely, 


! The Logical Syntax of Language, p. 318. 

a “Replies and Systematic Expositions,” in The PaAtslosophy of Rudolf Carnap, pp. 
859-1018, here p. 921. 

* For an especially sharp version of this bifurcation, see Carnap's “Empiricism, 
Semantics, and Ontology.” 
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reductionism). None of these arguments conclusively demonstrates 
the unacceptability of the distinction. The first argument (cluster of 
arguments) merely shows that past attempts to define the distinction 
have failed, not that any attempt to define it is bound to fail; the sec- 
ond argument shows that a popular epistemology incorporating the 
distinction suffers serious difficulties, not that any epistemology in- 
corporating it is bound to suffer these difficulties, or that these diff- 
culties are in principle insurmountable. In spite of the inconclusive 
nature of these arguments, Quine arrives at a radical and uncom- 
promising conclusion: the analytic-synthetic distinction should be 
altogether and in principle banished from philosophy. 

Quine’s “jump” from the claim that past attempts to establish the 
distinction face serious difficulties to the claim that the distinction is 
(in principle) an untenable dogma is obviously not a step in deduc- 
tive reasoning. Rather, it is a methodological change of direction, 
similar to other methodological shifts in the history of philosophy: 
for example, Immanuel Kant’s so-called “Copernican” revolution. 
We could, indeed, use roughly the same reasoning as Kant’s (or the 
same skeleton of reasoning) to close the gap in Quine’s argument: 


Hitherto it has been assumed that all our knowledge must be reducible 
to ultimate elements of two kinds—analytic and synthetic. But all our 
attempts to provide a solid foundation for knowledge based on this divi- 
sion have ended in failure. We must therefore make trial whether we 
may not have more success in the task of epistemology, if we suppose 
that knowledge is not divisible in this way.” 


Quine’s rejection of the analytic-synthetic dichotomy, like Kant’s 
rejection of the traditional world-observer dichotomy, is a promis- 
sory note, to be made good by a new model of knowledge which 
demonstrates the virtues of the new methodology. The construction 
of such a model is undertaken in the second part (section six) of 
“Two Dogmas,” where Quine draws an outline of a new epistemic 
model, CP, whose task is to vindicate NAS. Quine is quite clear 
about the intended role of CP. Immediately following the introduc- 


H For exposinon and critical discusmon of these arguments, see Gochet; H.P. 
Gnce and P.F. Strawson, “In Defence of a Dogma,” The Philosophical Renew, LXV 
(1956): 141-58; Roger F. Gibson, The Philosophy of WV Quine An Expontory Essay 
(Tampa: South Florida UP, 1982); Paul A. Boghoaman, “Analytcity Reconsidered,” 
Notts, xxx (1996): 360-91. 

* An adaptation of a passage in Gnagque of Pure Reason (1787, second edition): 
“Hitherto it has been assumed that all our knowl must conform to objects. 
But all attempts to extend our knowledge of objects by establishing something in 
regard to them a pron, by means of concepts, have, on this assumption, ended in 
failure. We must therefore make trial whether we may not have more success in 
the tasks of metaphysics, if we suppose that objects must conform to our knowl 
edge” (Bxvi)—~Norman Kemp Smith, trans. (London: Macmillan, 1929). 
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tion of the model, he announces its direct implicational relation to 
NAS: 


If this view [the view of knowledge represented by CP] is right,...1t be- 
comes folly to seek a boundary between synthetic statements...and ana 
lytic statements...."* 


Furthermore, tenets central to NAS follow from the model: 


...1t is misleading to speak of the empirical content of an individual state- 
ment... Any statement can be held true come what may, if we make 
drastic enough adjustments elsewhere in the system.... Conversely, by 
the same token, no statement is immune to revision... (2ad.).”” 


It is evident that Quine regards CP as a realization of NAS. The 
model is designed to justify NAS by example, not by counterexam- 
ple. CP and NAS are not two independent theses, the one descrip- 
tive, the other normative, the one capturing one aspect of our 
system of knowledge, the other, another. Rather, the two theses are 
designed to complement one another: CP by providing an attractive 
picture of knowledge in accordance with NAS, NAS by motivating 
and articulating the normative principles underlying CP. Since NAS 
is a normative thesis, CP is assigned the role of a normative model 
exemplifying its principles.” 

If this analysis is correct, the third objection is dissolved: CP is a 
normative thesis and its conflict with NAS is a genuine conflict, 
threatening the integrity of Quine’s project. In the next section, I 
shall consider, and reject, two eliminative solutions to this conflict. 
In the last two sections, I shall present a third, constructive solution, 
with substantial ramifications for philosophy. 

Il. TWO SOLUTIONS 
I shall begin with two eliminative solutions: (A) withdraw NAS, (B) 
withdraw CP. 


'¢ “Two Dogmas of Empiricism,” p. 48. 

™ The revisability principle, as we have seen above, is not unique to NAS, but it 
still constitutes an important component of NAS. 

* One may try to argue for a descriptive readmg of CP based on the third para- 
graph of section 6: “For vividness I have been speaking...” (op. at, pp. 43-44). But 
even if ıt is natural to read a few sentences in this paragraph as suggesting a descrip- 
tive model, my argument is not affected. What I have attempted to do is not to show 
that no sentence ın “Two Dogmas” is compatible with a descriptive model. Rather, 
I have attempted to show that (i) a descriptive model of knowledge would be extra- 
neous to Quine’s main argument in “Two ,» (ii) the main argument of 
“Two Dogmas” calls for a normative model, and (i) there is strong textual evidence 
to support a normative construal of the model. In contrast, interpreting CP as a de- 
scriptive model would raise difficult questions with respect to the mterpretation of 
Quine’s overall argument, the contribution of CP to this argument, and the role of 
the indicated passage in an otherwise normative section, sandwiched between two 
clearly normative passages. My tendency is to view the third paragraph as a digres- 
sion, possibly foreshadowing Quine’s later epistemology. 
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A. First solution: withdraw NAS. This solution is recommended by 
Dummett. Quine’s main achievement in “Two Dogmas of Empiri- 
cism,” according to Dummett, is a new verificationist model of lan- 
guage (knowledge) which, by being moderately holistic (‘organic’, 
in Dummett’s terminology), avoids the pitfalls of earlier models. 
Since NAS threatens to undermine this model, Dummett recom- 
mends the elimination of NAS. He finds support for his proposal in 
Quine’s own “change of heart” in Word and Objed and later writ- 
ings, where, according to Dummett, Quine has “quietly dropped” 
NAS.” 

Disputation. I shall offer three reasons— exegetical, historical, and 
substantive—for rejecting this solution. The second reason I shall 
merely allude to, the other two I shall discuss in more detail. 

(1) Exegesis. (a) Early writings. It is clear that prior to Word and Ob- 
pect, Quine’s espousal of NAS was full hearted and uncompromising. 
Note the sharp tone and disparaging expressions in the following ci- 
tations from “Two Dogmas”: “for all its a priori reasonableness, a 
boundary between analytic and synthetic statements simply has not 
been drawn. That there is such a distinction to be drawn at all is an 
unempirical dogma of empiricists, a metaphysical article of faith”; 
“jt is nonsense, and the root of much nonsense, to speak of a lin- 
guistic component and a factual component in the truth of any in- 
dividual statement”; “If this view is right,...it becomes folly to seek a 
boundary between synthetic statements...and analytic statements” 
(op. cit., pp. 37, 42, 43). In other articles from the same period, 
Quine’s message is just as strong: “My misgivings over the notion of 
analyticity are... misgivings in principle”; “analyticity...is a pseudo- 
concept which philosophy would be better off without."* In con- 
trast, Quine’s advocacy of the center-periphery duality is somewhat 
qualified: the idea of a distance from a center and a periphery is 
merely a “metaphor,” brought in for the sake of “vividness” and not 
to be taken quite literally.” There is no doubt that faced with a de- 
cision between the two theses, Quine of “Two Dogmas” would have 
chosen NAS. 

(b) Later writings. In spite of endorsing a few watered-down versions 
of the analytic-synthetic distinction in some of his later writings, 


» Cambridge: MIT, 1960. 

* See Dummett, “The Significance of Quine’s Indeterminacy Thesis,” pp. 
377-78. 

"= “Mr. Strawson on Logical Theory” and “Three Grades of Modal Involvement” 
in The Ways of Paradox and Other Essays, pp. 137-57 and pp. 158-76, here p. 189 and 
p. 171, rexpectively. 

™ See “Two Dogmas,” p. 43. 
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Quine has never wavered in his rejection of the fullblooded distinc- 
tion. Characterizing his own version in Word and Object (stimulus ana- 
lytic versus stimulus synthetic) as a “behavioristic ersatz” and a 
“vegetarian imitation” (op. cit, pp. 66, 67), Quine declares that the 
fullscale distinction “only encourages confused impressions of how 
language relates to the world” (op. cit., p. 67). Then, commenting on 
the customary grouping of “synthetic truth,” “fact,” “objectivity,” and 
“accessibility to observation,” he says: “So here that same analytic- 
synthetic dichotomy intrudes which we have found so dubious” (op. 
cit, p. 247). And as usual, Quine is highly critical of Carnap’s uses of 
the distinction: “Carnap has long held that the questions of philoso- 
phy, when real at all, are questions of language.... But why should 
this be true of the philosophical questions and not of theoretical 
questions generally? Such a distinction of status is of a piece with the 
notion of analyticity..., and as little to be trusted” (op. cit, p. 271). 
Quine’s impatience with “analyticity” is once again revealed in Phaloso- 
phy of Logic ®: “it is time to rein in our verbalism. What are we trying 
to get at when we call a sentence analytic, or true purely by virtue of 
the language” (ibid, p. 96)? In The Roots of Reference,“ Quine contin 
ues to emphasize the difference between his own version of the dis- 
tinction and Carnap’s: “we have here no such radical cleavage 
between analytic and synthetic sentences as was called for by Carnap 
and other epistemologists” (ibid., p. 80). And as late as Pursuit of 
Truth,” Quine says: “The importance of analyticity for epistemology 
lapses, be it noted, in the light of holism” (iid, pp. 55-56). There is 
no question that Quine’s commitment to NAS has persisted through 
the years. 

(2) Historical perspective. Historically, the analytic-synthetic distinc- 
tion is tied to a long chain of philosophical dichotomies: the apriori 
versus the aposteriori, the necessary versus the contingent, reason 
versus experience, reality versus appearance, objectivity versus sub- 
jectivity, knowledge versus opinion, and so on. It is hard to exagger- 
ate the role of these dichotomies in initiating philosophical 
movements: rationalism, skepticism, idealism, empiricism, pragma- 
tism, and positivism have all been driven by some or all of these di 
chotomies. In repudiating the analytic-synthetic distinction, NAS 
calls into question this entire line of philosophical distinctions. 


= Englewood Cliffs: Prentice Hall, 1970. 
™ La Salle: Open Court, 1973. 
* Cambridge: Harvard, 1990. 
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From a historical perspective, therefore, NAS is a revolutionary the- 
sis, challenging some of the most fundamental assumptions of phi- 
losophy and offering an altogether new approach to basic 
philosophical problems. In contrast, CP treads on well-known 
grounds, being a variant of the logical positivist account of knowl 
edge and an offspring of classical empiricism. From the point of 
view of their critical as well as their innovative potential, NAS is un- 
questionably of far greater philosophical interest than CP. 

(3) Strength of thests. I shall present an outline of three epistemic 
arguments in support of NAS. 

(a) A sound methodological principle. One of the consequences of the 
analyticsynthetic dichotomy is a rigid methodology of confirmation 
and infirmation: synthetic truth is a matter of fact and is therefore con- 
firmed or infirmed by whatever counts as factual evidence with respect 
to it; analytic truth is a matter of something other than fact, hence 
there is no such thing as factual evidence for (or against) it. Synthetic 
statements are justified by appeal to the world; analytic statements by 
appeal to X, where Xis language, method, convention, thought, mind, 
and so on. Synthetic truths are vulnerable to experience, analytic truths are im- 
munes to experience. This methodological principle has never been ade- 
quately established. Indeed, one of the lessons that history has taught 
us is that no statement is immune to experience, and many revisions 
blur the line between the factual and the pragmatic: “Revision even of 
the logical law of excluded middle has been proposed as a means of 
simplifying quantum mechanics; and what difference is there in prind- 
ple between such a shift and the shift whereby Kepler superseded 
Ptolemy, or Einstein Newton, or Darwin Aristotler”™ 

Hilary Putnam’s” analyses of three episodes in the history of sci 
ence—the replacement of Newton's definition of kinetic energy by 
Einstein's, the passage from Euclidean to non-Euclidean physical 
geometry, and the development of a nonstandard logic for quantum 
mechanics—further support Quine’s claim. Putnam concentrates 
on the factual element in the revision of “analytic” items. The revit 
sion of definitions, logical laws, and mathematical theories, accord- 
ing to Putnam, does not have to be conducted on a separate level of 


™ “Two Dogmas of Empincism,” p 

r “The Analyne and the Sae in Mind Language and Reality. Philosophical 
Papers, Volume 2 (New York: asain 1975), pp. 38-69, and “The Logic of 
Quantum Mechanics” in Mathematics, Matter and Method: Philosophical Papers, Vol- 
ume 1 (New York. Cambridge, 1979, second edition), pp. 174-97. 
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inquiry (presumably, the level of metatheory) or guided by a special 
set of norms (presumably, purely pragmatic norms). Rather, in the 
course of scientific change, a definition may come to be treated as an 
identity statement (a law of nature),” an abstract geometry as a physical 
theory (a theory of the geometrical features of physical space), and a 
logic as a method of reasoning for a particular formal structure (a 
structure of objects possessing certain formal characteristics). In this 
way, the laws of logic are, in principle, vulnerable to the world: if the 
formal behavior of objects conflicts (in some “deep” way) with a 
particular logic, then this logic is challenged by the behavior of those objects.” 

Using the metaphor of a battle between nature and humanity to 
represent the pursuit of knowledge,” we can describe the method- 
ological rationale for NAS as follows: the analytic-synthetic di- 
chotomy creates a false line of defense against nature. Nature, the 
analytic-synthetic dichotomy induces us to believe, is in principle in- 
capable of threatening the analytic zone of our knowledge. But na 
ture might (and some say, does) find ways to encroach upon this 


zone. The analytic-synthetic policy of complacency in the analytic 
zone, careful measures for establishing the correctness of our theo- 
ries in the synthetic zone, is therefore unwarranted. We do not 
know in advance where nature will choose to strike next, and by re- 
stricting our defenses to the synthetic front of knowledge, are we not 
creating an epistemic Maginot line? As a matter of prudent strategy, 


* Putnam uses Albert Einstein's definition of kinetic energy to exemplify the 
similarity between definiuons and laws. The definition of kinetic energy, accord 
ing to Putnam, is motivated by the same kind of conmderations and has the same 
kind of role in Einstein's theory as such bona fide laws as ‘Moving clocks slow 
down’ and ‘Nothing exceeds the speed of light’—"The Analytic and the Synthetic,” 
pp. 42-46. The treatment of definitions as identity statements 1s also familiar from 
logical semantics. If T and T’ are theones form ulaked within the framework of 
standard logic, and T differs from T’ only in having ‘a py f$) where T' has‘a= 
f$) (or m having ‘Ax = p Bx’ where T’ has ka (Na Ar Bx}'), then T and T’ 
have exactly the same models, exactly the same logical consequences, and so on, 
that is, Tand T’ are model- -theoretically indistinguishable. 

* This way of looking at logic is common among quantum logicians. The rules 
of classical logic, these logicians argue, are “obeyed” by systems of objects embedd- 
able in a Boolean structure, but systems exhibiting other formal structures, for ex- 
ample, the structure of a nondistributive complemented lattice, require a different 
logic. See Pumam, “The Logic of Quantum Mechanics"; David Finkelstein, “Mat- 
ter, Space and Logic,” in Robert S Cohen and Marx W Wartofsky, eds., Boston 
Studs wn the Philosophy of Samca, Volume 5 (Boston. fone 1969), pp. 199-215; 
and R.I.G. Hughes, The Structure and Interpretation of Quantum Mechanics (Cam- 
bridge. Harvard, 1989), chapter 7. This way of looking at logic may also underhe 
Alfred Tarski's statement that “certain ..experiences of a very fundamental nature 
may make us inclined to change just some axioms of logic’--Morton White, “A 
Philosophical Letter of Alfred Tarski,” this JOURNAL, LXXXIV, 1 (January 1987): 
28-32, here pp. 31-82. 

“ The metaphor is taken from Jaakko Hintikka; see, for example, his 

“Language-Games for Quantifiers,” in Logic, Language-Games and Information: 
Kantian Themes in the Philosophy of Logic (New York: Oxford, 1973), pp 53-82. 
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it is incumbent upon us to maximize the maneuverability of our cog- 
nitive resources, and just this is accomplished by NAS. 

One possible objection to this line of reasoning can be formulated 
as follows: granting the unpredictability of nature, it is still not clear 
how nature can surprise us when it comes to our own concepts. If the 
concept of A includes the concept of B, what can nature do about it? 
How can nature falsify our judgment that every A is a B? This chab 
lenge can be expressed both in Kantian and in positivist terms: in 
Kantian terms, the claim is that some truths are grounded exclusively 
in meaning; the positivist claim is that some statements are true by 
postulation. Quine’s response to the positivist challenge appears in 
“Carnap and Logical Truth” (1954) where he argues that viewing any 
statement as true by postulation is committing a genetic fallacy. 

(b) The genetic fallacy argument. The genetic fallacy is, traditionally, 
a fallacy of relevance: the genesis of a statement Z has nothing to do 
with the rationality of upholding Z. Quine’s genetic fallacy argu 
ment proceeds as follow. Suppose a statement Z made it into our 
corpus of knowledge on grounds of analyticity. What is the signif- 
cance of this? It shows that at a particular moment in the history of 
our corpus, Z was accepted based on postulation. But this has noth- 
ing to do with the reasons for maintaining (or readmitting) Zata 
later stage in the development of our corpus. Analyticity and syn- 
theticity are no more than genealogical properties of statements— 
significant historically, but not normatiuely. Normatively, our task is to 
develop a correct theory of the world. Eventually, our postulates 
must earn their place in our theories not just in virtue of being con- 
venient but also in virtue of being true.” 

A similar argument by J.G. MaaB™ was presented in 1790 against 
Kant’s version of the analyticity claim. Whether a given statement 
satisfies Kant’s conditions on analyticity varies, according to MaafB, 
from speaker to speaker and from one occasion of speech to another: 

= The ent of establishing the truth of postulates is commonplace, ac- 
cording to e: any theoretcal hypothesis is, at the time it is made, no more 
than a conventional postulate (being justified by nothing more than “the elegance 
or convenience...[it] brings to the containing body of laws and data”), and there- 
fore the process whereby we test it for agreement with experience is just the 
process whereby a conventional postulate is tested for tuth—“Carnap and Logical 


Truth,” p. 396. The need to prove the truth of postulates arises also in mathemat- 
ics, where postulates are often thought of as obvtoush (rather than conventionally) 
true. Thus, Georg Cantor, for example, held the wellordering prinaple to be a 
self-evident postulate, only later to require a ngorous proof of this pnnciple. See 
Gregory H. Moore, Zermelo’ Axiom of Choe: Its Ongms, Development, and 

(New York: Springer, 1982), sections 1.5 and 1.6. 

R MaaB’s article, “On the Highest Principle of Synthetic Judgments in Relation 
to the Theory of Mathematical Certainty,” appeared in the second volume of the 
ant-Kantian journal Phtlosophiuckes Magaun, see Henry E. Allison, The Kant-Eberhard 
Controversy (Baltimore: Johns Hopkins, 1973), pp. 41-45. 
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some speakers arrive at ‘A is B’ by being taught that the concept 
of B is included in the concept of A, others, by observation; 
some speakers extend their knowledge by adding ‘A is B’ to 
their corpus, others (or the same speakers on other occasions) do 
not. Kant (through a review by J.G. Schulze™) rejected Maap’s 
criticism on the ground that relations between concepts are fixed: 
we can change the reference of our words, but not the reference 
of our concepts. Quine, however, denies the “fixity” of concepts, 
and his own account provides a new theoretical basis for MaapB’s 
criticism. 

(c) The nonfixity of concepts. The traditional analytic-synthetic dis- 
tinction involves the assumption that our concepts are fixed: there is 
a fixed fact of the matter concerning the content (meaning, refer- 
ence, composition) of our concepts, hence there are immutable re- 
lations between them; theories change, concepts are permanent. 
Against this view, Quine introduces his principle of the nonfixity of 
concepts. This principle is closely related to his thesis of the insepa- 
rability of language from theory, according to which the meaning of 
terms is inextricably tied with theories in which they occur: the 
meaning of ‘atom’ with theories of matter, the meaning of ‘light’ 
with optical theories, and so on.” It follows from this thesis that 
change in theory involves change in the content of our concepts, 
that is, the content of our concepts is not fixed. 

Quine’s thesis of the nonfixity of concepts is deepened and ex- 
panded by Putnam in “The Analytic and the Synthetic.” Putnam es- 
tablishes the nonfixity of concepts on two grounds: (i) a 
Wittgensteinian analysis of certain scientific concepts as (in effect) 
“family resemblance” concepts, and (ii) the observation that some- 
times scientific concepts emerge from a radical change in theory 


™ Review of the second volume of the Philosopbhisches Magar, see Allison, pp. 


174-75, 
H Even in literature our use of terms is affected by theory. Thus, in “A View 
from Elm Street,” Umberto Eco points out that “The bible obliges its readers to in- 


terpret a whale as 2 big fish; contemporary novels presuppose that whales are to be 
interpreted as mammals. When Melville seeks to play a double peu and to tell of a 
whale that must be considered at the same time as both a fish and a mammal, he 
spends an entire chapter making his readers eager to overlap two different ency- 
clopedias"—Paulo Leonardi and Marco Santambrogio, eds., On Quine: New Essays 
(New York: Cambndge, 1995), pp. 22-36, here p. 24. 
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changed in content but unchanged in identity. Explanation: Put- 
nam observes that some scientific concepts are interwoven in our 
theories in such a way that their content is determined by the cluster 
of laws associated with those theories. (The concept of energy, for 
example, is embedded in a cluster containing the law of conserva- 
tion of energy, laws concerning chemical, gravitational, and nuclear 
energy, laws relating energy to mass and momentum, and so on.) A 
change in some of these laws results in a change in the content of 
those concepts, but since those concepts are connected with a host 
of other laws (which presumably do not all change at once), their 
identity is preserved across time. Whether and in what way a given 
law cluster concept will change its content is an open question, ac- 
cording to Putnam: nothing in the concept itself can protect it 
against change. He restricts his analysis to certain “privileged” con- 
cepts of physical science: “energy,” “light,” “straight line,” and so on. 
But (as Putnam is well aware) there is nothing in his analysis to ex- 
clude other concepts from exhibiting the same behavior. Indeed, it 
follows from the genetic fallacy argument that even concepts origi- 
nally introduced by convention can become interwoven in our theo- 
ries in the way Putnam describes. (He claims that at every stage in 
the development of science some concepts must play the role of 
“fixed points.” This claim granted, it does not follow that the same 
concepts must play this role at every stage, that is, that there are in- 
herently “fixed” concepts.) The view that all concepts are, in princi 
ple, sensitive to change in theory is tantamount to Quine’s thesis of 
the inseparability of language from theory. It follows from this claim 
that no statement is guaranteed to be true by the “fixed” meaning of 
its terms, that is, no statement is analytic in the traditional sense. 

This completes my outline of a defense of NAS: exegetically, 
Quine is strongly committed to NAS; historically, NAS is a revolu- 
tionary thesis of much philosophical interest; and substantively, NAS 
is supported by three epistemic principles: (i) the principle of maxi- 
mizing the maneuverability of our cognitive resources, (ii) the prin- 
ciple of the normative insignificance of the genesis of our theories, 
and (iii), the principle of the nonfixity of concepts. 

B. Second solution: withdraw CP. The motivation for this solution is 
straightforward: If NAS is worth preserving and CP is incompatible 
with NAS, why not eliminate CP? 

ion. We cannot eliminate CP tout court, since some positive 
model of knowledge is needed to validate NAS. Furthermore, CP, in 
its present form, makes two indispensable contributions to Quine’s 
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theory: (1) u gives structure to his holism, (2) it provides a link between 

(1) Structuring holism. Without something like the center-periphery 
duality, Quine’s system of knowledge is a formless body of haphaz- 
ardly interrelated statements. Such an amorphous holism is obvi- 
ously unacceptable. First, it lacks explanatory power: to give an 
explanatory account of knowledge is to impose a meaningful struc- 
ture on its elements. Second, it renders our body of knowledge ut- 
terly unmanageable: a structureless object can be grasped either in 
its entirety, hence in one act of cognition, or not at all; but our sys- 
tem of knowledge is obviously too large, too complex, and too open 
ended to be grasped in one act. Conclusion: a viable holism must 
impose a manageable structure on our body of knowledge; that is, a 
viable holism is not just a structured holism; it is a structured holism 
with a manageable structure.™ 

CP introduces both structure and simplicity into Quine’s holism. 
It postulates a relatively simple structure of knowledge: a field or a 
sphere with two distinguished zones. The one zone— center—is the 
center of interconnectedness, the other zone— penphery— marks the 
outer reaches of the system. From the center, our body of knowl- 
edge stretches outward toward the periphery, moving gradually from 
the universal to the particular, from the general to the specific, from 
the abstract to the concrete, from the conventional to the factual, 
until reaching its outer boundaries. The periphery is linked to the 
center by a network of cognitive routes (routes of deductive and sta- 
tistical inference; routes of abstraction and generalization; routes of 
application and instantiation; routes of revision, explanation, justifi- 
cation; and the like), and it is through this network of roads that in- 
formation, evidence, and other epistemic commodities move across 
the system. 

(2) Anchoring knowledge in reality. By itself, Quine’s holistic concep- 
tion of knowledge with its principles of interconnectedness, univer- 
sal revisability, NAS, pragmatism, and so on is compatible with a 
coherentist or even an idealistic epistemology. Any system of state- 
ments will satisfy these principles, provided certain conditions hav- 
ing to do with the internal structure of the system are satisfied. The 
transformation of Quine’s holistic theory of knowledge into a theory 


=» Dummett’s criticism of NAS in Frege: Philosophy of Language is based on the 
(mistaken) assumption that NAS itself implies a structureless holism. Some of 
Jerry Fodor and Ernest Lepore’s criticisms m Hobsm: A Shopper's Guide (Cambridge: 
Blackwell, 1992) are also directed at a structureless holism 
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of knowledge of reality is accomplished by CP. CP delineates the in- 
ner structure of our system of knowledge in relation to reality. It an- 
chors our corpus of knowledge in reality, on the one hand, and in 
language (the mental), on the other. Our corpus is linked to reality 
through the periphery, and the impact of reality permeates the cor- 
pus due to interconnections between its elements. These intercon- 
nections are regulated in the center, where the influence of 
language, method, mind, pragmatic norms, and so on is most 
strongly felt. Changes may take place anywhere in the sphere, but 
all changes must square with reality (directly or indirectly) through 
the periphery. Even a theory as highly removed from experience as 
Kinstein’s general relativity theory must square with experience 
through the periphery, that is, via crucial experiment. If we call the 
requirement that our system of knowledge be anchored in reality re 
alism (or minimal realism), then a theory of knowledge based on NAS 
requires something like CP to render it (minimally) realistic. 

In making these contributions to Quine’s epistemology, however, 
CP assumes the form of an ultratraditional model: one fixed 
center—the site of logic, mathematics, and philosophy-—~sur- 
rounded by scientific theories whose supporting evidence (in the 
form of experiential reports) is transported from the periphery. We 
may liken this structure to a traditional industrial economy: one me- 
tropolis—the locus of capital, services, means of communication — 
surrounded by industrial sites whose raw materials are transported 
from quarries and mines across the country. The system as a whole 
is interconnected, but the norms governing its center and periphery 
are radically different: the norms governing statements in the center 
have to do with the quality of “general services,” those governing 
statements in the periphery—with the quality of “raw materials” (the 
embeddedness of the system in reality). It is this rigid division of the 
cognitive norms that is irreconcilable with NAS. 

IV THIRD SOLUTION FROM AN ABSOLUTIST-STATIC MODEL TO A 
CONTEXTUAL-DYNAMIC MODEL 

Our analysis suggests that NAS is an innovative thesis of much philo- 
sophical interest, while CP makes an important contribution to 
demonstrating its epistemic viability. Neither thesis should be elimi- 
nated, yet the two stand in a fundamental tension. A natural way of 
resolving the problem is by reconfiguring CP, that is, identifying 
those elements of CP which are responsible for the conflict and re- 
placing them by other elements in agreement with NAS. 

My solution is based on the following observation: there i is no com 
flict between the idea, systematized by CP, that a two 
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of knowledge are language and the world, and the idea, central to 
NAS, that the division between them within a given statement is net 
ther fixed nor lasting. Rather, the root of the conflict is the assump- 
tion, implicit in CP, that each statement, and, by extension, each 
theory, is located within a fixed region of the sphere—within a fixed 
distance from its center and boundary-—~and this distance is deter- 
mined by the ratio of conventional to experiential factors in its truth 
and justification. The presumption of a fixed location involves the 
requirement that a fixed range of experiences and a fixed range of 
conventions be associated with each statement, but exactly this is 
contested by NAS. It is thus not the existence of two distinct sources 
of knowledge, metaphorically referred to as “center” and “periph- 
ery,” that is the source of the conflict; rather, it is the postulation of 
fixed factual conditions and fixed conventional conditions for the 
truth and justification of each statement that is responsible for it. 
The fixity requirement has two dimensions: (i) at any given point 
in the duration of our corpus each statement occupies a fixed 
(unique, absolute) position within the sphere, and (ii) while in the 
course of history a statement may change its exact location (for ex- 
ample, as a result of a re-axiomatization of a theory to which it be- 
longs), its general area as well as its rough distance from the center 
and the periphery remain the same. This is true especially of ob- 
servation statements, which always occupy the periphery, and of 
logico-mathematical statements, which are always in the center. 

My solution to the inner conflict in Quine’s theory lies in replac- 
ing this traditional, static, and absolutist model by a new, dynamic, 
and contextual model. There are still two zones, corresponding to 
the two basic constituents of human knowledge, but no fixed distrib- 
ution of statements and theories within the model. As a theory of 
the formal laws governing structures of objects—for example, the 
laws relating identity to cardinality—logic is in the periphery; as a 
theory of the logical laws governing statements and sets of state- 
ments, logic is in the center. As a theory saying that a structure of all 
identical objects is a structure of one object, logic is about the world; 
as a theory saying that the statement ‘All objects are identical’™ im- 
plies the statement “There is exactly one object’, logic is about lan- 
guage. As a theory of the formal structure of “real” objects 
(configurations of objects), logic is anchored in reality, as a theory of 


» Or, rather: ‘All objects are identical to some object’ 
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the formal structure of our thought of objects, logic is embedded in 
the mental. A similar duality exists in other disciplines. Take zook 
ogy, for example: as a theory of the hierarchy of life, it is in the pe- 
riphery; as a system of concepts of life (as a hierarchy of zoological 
concepts), it is in the center. 

Normatively, the project of knowledge consists of two tasks: the 
task of constructing a serviceable set of conceptual tools, and the 
task of constructing a correct and informative theory of the world. 
Accordingly, our system of knowledge is measured by standards of 
two types: substantive standards—truth, confirmation, prediction, 
explanation; and pragmatic standards—economy, convenience, utik 
ity. In constructing a system of knowledge, we move back and forth 
between the center and the periphery: we set out to improve the 
overall usefulness of our conceptual apparatus, and we set out to ver- 
ify the correctness of our theories. Each theory in our corpus must 
square off with some facet of reality: logic with the forma! behavior 
of objects, physics with their physical behavior. Given a conflict with 
reality, we are given great latitude in devising a solution. We may re- 
solve the conflict by making changes in our physical theory of the 
world, keeping logic “fixed” in the background; or we may resolve it 
by changing our theory of the formal structure of the world, keeping 
the physical laws “fixed” in the background. We may resolve the 
conflict by placing physics in the “front” (periphery), logic in the 
“rear” (center); or we may resolve it by placing logic in the front, 
physics in the center. By holding our notion of physical object fixed, 
we constrain the variability of our logic; by holding the laws of logic 
fixed, we restrict the variability of our physics. Ontology plays a simi 
lar role, as do other branches of philosophy. Philosophy, on this pic- 
ture, is not above (or below) the sciences; rather, philosophy— through 
its theories of logic, object, truth, rationality, and the like—faces the 
court of knowledge both in the center (on the level of concepts and 
method) and in the periphery (on the level of fact), just like other 
sciences. And like other sciences, philosophy has its own identity, 
methods of inquiry, and relative autonomy. (Philosophy is not re- 
ducible to empirical psychology.) 

While Quine’s narrow naturalism is rejected by our model, many of 
his early theses are integrated into it NAS, the interconnectedness of 
knowledge, the universal revisability of beliefs, the inseparability of lan- 
guage from theory, (minimal) realism, scientific empiricism, antifoun- 
dationalism, and certain important features of the center-periphery 
duality. In particular, the new model continues to make the two main 
contributions of the original CP: (i) structuring the interconnections 
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between statements (and between theories), and (ii) linking our system 
of knowledge to reality. Unlike the old model, however, the new 
model is dynamic and contextual: at each stage in the development of 
our corpus, each science lies closer to the periphery in some contexts, 
closer to the center in others, and in the course of history, sometimes 
one constellation of theories, sometimes another, occupies a more 
prominent place in the periphery. 

Methodologically, the new model is both (highly) pragmatist and 
(moderately) metaphysical: pragmatist in the idea that given a con- 
flict, we have great latitude in maneuvering our cognitive re- 
sources; metaphysical in the idea that some conflicts with reality are 
best met by placing logic, philosophy, and mathematics in the pe- 
riphery. Such a combination of metaphysics and pragmatism is, ac- 
cording to Quine, characteristic of NAS: “One effect of 
abandoning [the traditional dogmas] is...a blurring of the sup- 
posed boundary between...metaphysics and natural science. An- 


other effect is a shift toward pragmatism.”” 
V THE NEW CP: AN OUTLINE AND CRITICAL REMARKS 


To clarify the structure, scope, and ramifications of the new CP, both 
as a critical interpretation of Quine’s theory and as a model standing 
on its own, I shall offer a concise outline of the model. The outline 
will center on the two dimensions along which concepts, statements, 
and theories shift their position within the model: the time dimension 
and the context dimension. It is quite clear that the addition of these 
dimensions to the old CP does not detract from its ability to repre- 
sent the Quinean themes of interconnectedness, universal revisabil- 
ity, inseparability of language from theory, pragmatism, and so on. 
The outline will focus, therefore, on the movement of statements 
and theories along the two dimensions, the sense in which logic and 
philosophy lie (in certain respects and at certain times) in the pe- 
riphery, the normative principles (associated with NAS) underlying 
the model, and some of the model’s ramifications for outstanding 
philosophical problems. In drawing the outline, I am aiming at clar- 
ity and, to a certain extent, systematicity, but not at a comprehensive 
or exhaustive account. While the outline expands the exegetical ba- 
sis of my solution by drawing upon Quinean ideas not considered in 
earlier sections, it is important to remember that the model is in- 
tended as a critical interpretation (and development) of Quine’s the- 
ory, and as such it diverges from Quine on certain significant points. 


” °Two Dogmas of Empiricism,” p. 20. 
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The center-periphery metaphor is interpreted as a “family” of duali 
ties: “fact versus convention,” “objects versus concepts,” “front versus 
rear of knowledge,” and so forth. 

Two dimensions of change: the most distinctive characteristic of 
the new CP (compared with the old one) is the “movement” of cog- 
nitive elements along two axes: time and context. Diachronically, the 
new model is dynamic, that is, as our body of knowledge develops, 
the deployment of concepts, statements, and theories within it 
changes in response to changing circumstances. Synchronically, the 
new model is contextual, that is, at each stage in the development of 
our system each statement, concept, and theory falls within a mult- 
plicity of contexts, and viewed from the varied perspectives of these 
contexts, its position within the system changes. In the extreme 
case, statements (concepts, theories) shift their position from the 
center to the periphery and from the periphery to the center. 

A. Contextual change. We can distinguish two (complementary) in- 
gredients of contextual change: (1) multiplicity of perspectives, (2) mul 
tiblicity of factors. 

(1) Multiplicity of perspectives. Change in perspective may lead to 
change in orientation: viewed from one perspective, a concept, 
statement, or theory is world oriented, viewed from another, lan- 
guage or mind oriented; viewed from one perspective, a given sub- 
ject matter is factual, viewed from another, mental or conceptual. I 
shall consider two sources of such change in perspective: (a) the two- 
sidedness of language, (b) the plurality of human interests. 

(a) The two faces of language; semantic ascent and objectual descent. 
One source of the multiplicity of perspectives is the two-sidedness of 
language. It is an old observation (made by medieval philosophers) 
that sentences in general can be read both in the linguistic mode 
and in the factual mode. Take, for example, the sentence “Whales 
are mammals’. Read in the linguistic mode, this sentence says that 
‘whale’ is subsumed under ‘mammal’; read in the factual mode, it 
says that whales possess the characteristic properties of mammals. 
Read in one mode this sentence is language oriented, read in the 
other, world oriented. One important vehicle of such change in ori 
entation is the truth predicate. The truth predicate enables us to 
speak about the world by speaking about language, and to speak (in 
certain ways) about language by speaking about the world. Instead 
of saying that snow is white, we can say that ‘Snow is white’ is true, 
and instead of saying that ‘Snow is white’ is true, we can say that 
snow is white. Quine introduced the term semanuc ascent for one d+ 
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rection of this shift; I shall use odjectual descent for the other direc- 
tion. 

(i) Truth is both language and world oriented. The dual principle of 
semantic ascent-—objectual descent is encapsulated in the truth 
schema. But the truth schema itself, according to Alfred Tarski and 
Quine, allows both a linguistic and a factual reading. Read in the 
linguistic mode, the truth schema is a disquotational schema; read in 
the factual mode, a correspondence schema. Under one reading, 
truth has to do with an intralinguistic relation; under the other, with 
a relation between language and the world. While the two readings 
yield the same T sentences, their explanation of truth is radically dif- 
ferent. The disquotational account of truth is linguistic, the corre- 
spondence account, metaphysical; the disquotational notion of truth 
is language oriented, the metaphysical notion, both language and 
world oriented. 

(ti) Objectual descent: logic ts world oriented. The principle of objec- 
tual descent provides one explanation of logic’s presence in the pe- 
riphery. It follows from this principle that the logical laws, despite 
their appearance (for example, in the case of sentential logic) as 
laws governing the behavior of language, reflect certain regularities 
in the behavior of the world. In Quine’s words: “Logical theory, de- 
spite its heavy dependence on talk of language, is...world-oriented 
rather than language-oriented; and the truth predicate makes it 
so.... [T]he truth predicate is...doing an active job of separating 
logic from language."™ And elsewhere he says: “Consider...the logi- 
cal truth ‘Everything is self-identical’, or ‘(x)(x=x)’. We can say 
that it depends for its truth on traits of the language (specifically on 
the usage of ‘=’), and not on traits of its subject matter; but we can 
also say, alternatively, that it depends on...[a] trait, viz., selfidentity, 
of its subject matter, viz., everything.”™ On the second interpreta- 
tion, the law of self-identity is world oriented. 

(iti) The epistemic priority of the factual: the immanence of knowledge. 
While statements and theories within the model are both fact and 
language oriented, epistemically their factual orientation takes prior- 
ity over their linguistic orientation. The epistemic primacy of the 
factual is tantamount to Quine’s thesis of the mmanence of knowledge. 
This thesis says that we know the world through our theories—that 
is, from a standpoint internal to our theories—but speaking from 


™ Philosophy of Logic, p. 97. 
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within a theory is speaking factually. We use words (in our theories) 
to speak about objects, and in so doing we give priority to their fac- 
tual reading. (We can view this as the principle of “use over men- 
tion”). The epistemic priority of the factual over the linguistic 
means that all knowledge is, in a certain basic sense, fact or peripb- 
ery oriented. This principle is reflected in Quine’s claims that logic 
is “world-oriented rather than language-oriented,” and that mathe- 
matics and ontology are about “what there is.” 

(iv) Philosophy is fact oriented. The immanence of knowledge 
means that all sciences are fact oriented. We can explain the way 
this principle applies to philosophy (even beyond logic and ontok 
ogy) by contrasting the factuality of philosophy with its alleged con- 
ventionality. Consider Kant’s epistemology once again. The 
motivation for his “Copernican revolution” is, as we saw in section 1, 
largely pragmatic: all attempts to establish knowledge based on the 
traditional conception of the relationship between the knower and 
the world have failed; let us see whether by turning this relationship 
around we shall not be able to arrive at a better theory. But the the- 
ory constructed based on this move makes factual claims about its 
subject matter, namely, human knowledge. Speaking in terms of 
aims of theory, we can say that Kant’s theory aims not just at an expe- 
dient account of human knowledge, but also (and indeed primarily) 
at a correct account. Kant’s goal is to identify the real conditions for 
the possibility of knowledge (not some convenient pseudo cond+ 
tions), to give a correct analysis of the basic structure of our cogni 
tion (not a convenient fiction about its structure), and to provide a 
valid proof of the possibility of human knowledge (not the sem- 
blance of a valid proof, that is, a chain of statements that can conve- 
niently be imagined to be a valid proof). Kant aims at a correct 
theory of the structure and possibility of human knowledge, and in 
this sense his theory, and philosophy more generally, is oriented to- 
ward the periphery. This is perhaps the deepest sense in which phi 
losophy lies in the periphery. Philosophy seeks to obtain an 
understanding of knowledge, truth, morality, valid inference, and so 
on as they in fact are: their true factors, true structure, true possibil 
ity, true difficulties—all as they truly or factually are, not as we may 
conveniently wish, or imagine, or postulate them to be. 

Critical note: the factuality of philosophy means not that philoso- 
phy is less pragmatic than other sciences, but rather that the norms 
of truth (accuracy, absence of error, and so on) are as central to phi 
losophy as they are to other sciences. 
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(b) Multiplicity of interests. Another important source of the multi 
plicity of perspectives is the multiplicity of interests. The multiplicity 
of interests leads to the development of multiple theories of the same 
subject matter. Take the subject of correct reasoning, for example. 
By studying correct reasoning from a purely theoretical standpoint, 
we arrive at a “pure” logic; while by studying it from a point of view 
encompassing both the practical and the theoretical, we arrive at a 
(general) theory of rationality. Differences in perspective may lead to 
radical differences in our conception of a given subject matter: by 
thinking of the logical laws as laws of the mteraction of physical prop- 
erties (properties of physical states), quantum logicians have come to 
view them as empirical; by thinking of these laws as governing the ma- 
nipulation of symbols, the formalists came to view them as conven- 
tional. Different perspectives may either agree or disagree with one 
another: the intuitionistic viewpoint on logic competes with the classi 
cal viewpoint, whereas the modal viewpoint leads to its extension. 
Different theories of the same subject matter are located in different 
areas of the field: a psychological theory of reasoning lies closer to 
the periphery (in the traditional sense of site of experiment and ob- 
servation), a “pure” logical theory closer to the center. 

Critical note: not every point of view on every subject matter is vali- 
dated by the model. That is, the model does not represent a 
methodology of “anything goes.” On the contrary: it is a central 
principle of the new model (closely related to its representation of 
NAS) that every theory is subject both to the norms of truth (accu- 
racy, avoidance of error, justification, explanation) and to the norms 
of efficiency (simplicity, unity, generality, fruitfulness, and so on). 
And this means that the standards for the acceptance of a given view- 
point as a basis for a new theory are higher in the new model than in 
either the traditional or the positivist models. We can express this by 
saying that each theory is subject not just to the norms of utility but 
also to the norms of truth, and not just to the norms of truth but also 
to the norms of utility. As a result, most points of view on a given 
subject matter are rejected by the model (or rather by a world theory 
concordant with it) either as conducive to error or as inexpedient, 
unfruitful, and so on. Nevertheless, the principle of a multiplicity of 
viewpoints is affirmed by the model. 

(2) Multiplicity of factors. ‘The contextual mobility and malleability 
of subject matters, statements, and theories within the model is not, 
primarily, a subjective matter, having to do with our personal 
choices, whims, and proclivities, but (to a large extent) an objective 
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principle, having to do with the multifariousness of the world and 
the plurality of factors constraining the knower. 

(a) Multiplicity of factors constraining the knower, knowledge is not a mir- 
ror representation of reality. The view that knowledge is largely shaped 
by factors having to do with the knower is characteristic of many 
epistemologies. Kant, and, in a different way, contemporary psychol- 
ogists, linguists, and cognitive scientists, view mental structures as 
playing a crucial role in determining our theory of the world. The 
logical positivists emphasize the importance of pragmatic factors in 
shaping our knowledge. Following Thomas Kuhn, many philoso- 
phers, historians, and sociologists of science see scientific knowledge 
as largely determined by social, psychological, political, and environ- 
mental factors. The new model accepts the general principle under- 
lying these epistemologies. It follows from this principle that 
knowledge is not a simple picture of reality (words are not mere la- 
bels of objects; sentences and theories are not mirror images of 
“facts”), but numerous factors residing in the knower intervene be- 
tween our world theory and the world. 

Critical note: the existence of factors residing in the knower 
does not conflict with the applicability of the norms of truth. A 
street map is not a mirror image of a city, but relative to certain 
well-defined goals and standards, it is either a correct or an in- 
correct image of it. Likewise knowledge, in the present model, is 
not a mirror representation of reality, but it is (or is required to 
be) a correct representation of it. This inner duality is inherent 
in NAS: our statements and theories, according to NAS, are both 
conventional and factual; both governed by the norms of “sim- 
plicity, familiarity, scope, and fecundity,”* and subject to the 
norms of truth and evidence. This duality extends to ontology: 
the objects of our theories, according to Quine, are both “posits” 
and “what there is”; as “posits” their knowledge conforms to the 
norms of utility, as “what there is,” to the norms of truth. The 
model explains this two-sidedness of knowledge by its double ori- 
gin in the world and the mind. The interaction of “worldly” and 
“buman” factors is not treated as a mystery by the model, but as 
something for theories within it (epistemology, psychology, biol- 
ogy, sociology, and so on) to account for. 

(b) Multiplicity of factors concerning the world; the factuality of mathemat- 
ics, logic, and ontology. The model admits not only multiple factors re- 


* See “Ponts and Reality,” in The Ways of Paradox and Other Essays, pp. 246-54, 
here p. 247. 
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siding in the knower, but also multiple factors residing in the world. 
The multiplicity of worldly factors is related to the diversity of the 
world. Different factors—connected with different parts, features, 
and aspects of the world—contribute to our knowledge, and the 
plurality of these factors partly explains the plurality and diversity of 
our theories. It is commonplace to say that physics investigates a dif- 
ferent “region” of reality from biology; but in the new model this 
view extends to logic, mathematics, and philosophy. Scientists, from 
Copernicus to Einstein, regarded physical reality as exhibiting not 
only material features but also formal (mathematical) structure. 
The factuality and distinctiveness of mathematics are predicated, in 
part, on the reality and distinctiveness of such structures. The view 
that logic is grounded in special features of reality is suggested by 
Quine when he says: “admittedly [logical truth] depends upon none 
of those features of the world that are reflected in the lexical distinc- 
tions; but may it not depend on other features of the world, features 
that our language reflects in its grammatical constructions rather 
than its lexicone™ Ontology, according to Quine, is also concerned 
with certain features of reality, namely, properties of objects of an es- 
pecially broad nature: “The question what there is is a shared con- 
cern of philosophy and most other non-fiction genres.... What 
distinguishes between the ontological philosopher’s concern and 
[the zoologist’s, physicist’s and mathematician’s concerns] is... 
breadth of categories.” * 

Critical note: in considering the claim that logic and mathematics 
are (in certain respects) located in the periphery, it is important to 
note: (i) this claim does not imply that logic and mathematics are 
empirical; (ii) this claim is not tantamount to Platonism. In fact, nei- 
ther extreme empiricism (the view that apriori, all knowledge is re- 
stricted, or reducible, to empirical knowledge) nor Platonism (the 
belief in a “third realm”) is compatible with the model.” 

(c) Multiplicity of factors undertying institutions; reconciling holism and 
compostionality, The principle of multiplicity of factors applies not 
just to the knower and the world but also to the institutions of knowl 
edge. Take the institution of language, for example. The model 
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permits us to affirm both the holistic nature of language (the fact 
that the meaning of any term may, in certain respects and under cer- 
tain conditions, be affected by that of any other term) and its com- 
positionality (the fact that the meaning of complex terms depends, 
in some systematic way, on the meaning and mode of composition of 
simpler terms). The two are allowed to co-exist as two independent 
factors whose complex pattern of interaction is a matter for linguis- 
tics, psychology, sociology, the philosophy of language, and other 
theories within the model to investigate. By combining holism with 
discrete constituents, the model is “organic” in Dummett’s sense. 

(d) Observation: multiplicity of factors as nchness of structure; enhanced ex- 
planatory power. The multiplicity of factors characteristic of the new 
model amounts to richness of structure. The model portrays the 
process of knowledge as an intricate process, involving numerous fac- 
tors and constituents, dynamically interacting with each other. By al- 
lowing rich structures, the model’s ability to deal with complexity and 
diversity is enhanced. In particular, the model is capable of reconcil- 
ing elements which, in a simpler model, would appear to conflict with 
each other-—for example, holism and compositionality, as discussed 
above. Indeed, the difference between the old model and the new 
one can be explained by richness (poverty) of structure. By taking 
into account a relatively small number of factors, the old model delin- 
eates a very simple structure of knowledge: a static and rigid structure 
with a “fixed” center and a “fixed” periphery, stmulating the analytic- 
synthetic division. By adding two new dimensions (two new clusters 
of factors) to the old model, the new model adds movement and 
complexity to its structure, relativizing its center-periphery duality to 
context and time and reconciling it with NAS. 

B. Change tn Time. I shall note three types of diachronic change, 
corresponding to three aspects of the center-periphery duality: (J) 
factual orientation versus conceptual orientation, (2) “froni” versus “rear” of 
knoutedge, (3) variant versus nvanant (“fixed”) constituents of knowledge. 

(1) Change from factual to conceptual orientation (and vice versa). (a) 
Developmental change. At each stage in its development, each (viable) 
science has both factual and conceptual interests, but at different pe- 
riods in its development different interests may play a more, or less, 
dominant role. How to classify each period in the history of a given 


“ My use of ‘compositionality’ here ıs broader than usual. It covers any seman- 
tics which analyzes complex structures by a finite rithm based on their parts, 
including, in addition to Tarskian semantics, Leon Henkin’s functional semantics 
for partiallyordered quandficatons, Hintikka’s game-theoretic semantics for IF 
languages, and others. 
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science is a matter for the historian and philosopher of science to 
determine, but during periods of major conceptual advances the sci 
ence moves closer to the center, during periods of intensified factual 
investigations (either in the sense of experiment and observation, or 
in the sense of theoretical development) it veers toward the periph- 
ery. Logic and mathematics, too, alternate between the factual and 
the conceptual: the introduction of the modern quantifiers by Gott- 
lob Frege, for example, is naturally viewed as a conceptual contribu- 
tion, while Kurt Godel’s discovery of the incompleteness of 
arithmetic is, in an important sense, a factual contribution. All sch 
entific revolutions, according to Kuhn, involve major changes in 
conception and methodology (both practical methodology and the- 
oretical methodology); such changes are represented in the model 
as occurring (largely or partly) in the center. 

(b) Change tn grounds and significance. The same statement or the- 
ory can be accepted based on one ground during one period, based 


on another during another (the genetic fallacy argument); similarly, 
its significance may vary in the course of time. For example, it is 
quite natural to view non-Euclidean geometry as moving from a 
phase (in the nineteenth century) in which its significance was pri 
marily conceptual (independence of the parallel postulate and gen- 
eralization of Euclidean geometry to Reimannian geometry) to a 
phase (in the twentieth century) in which its role and significance 
are largely factual (a theory of the curvature of physical space). 

(2) Movement from “rear” to “front” of knoudedge (and vice versa). (a) 
Movement of whole disciplines. In the battle against nature sometimes 
one discipline, sometimes another, occupies the front. It is common 
to think of the natural and social sciences as occupying this position: 
physics in 1687 and 1905, biology in 1859, psychology in 1900, and 
so on. Traditionally, logic and philosophy are regarded as perma 
nently located in the center, “setting the stage” for the battle against 
nature rather than taking an active part in it. The NAS methodology 
represented by the model challenges this tradition. Speaking of log- 
ic’s role in contemporary physics, David Finkelstein says: “[Logic is 
a] dynamic ingredient in the physical theory, an actor rather than part 
of the stage."™ In the new model, logic plays both the role of an actor 
and that of managing the stage. As an actor, the challenges taken up 
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by logic are of a general and fundamental nature: the challenge of 
valid inference (transmission of truth from sentences to sentences), 
the challenge of bivalent and nonbivalent phenomena, and the like. 
Philosophy, too, is assigned a double role. One of the challenges 
posed by nature to philosophy is the skeptical challenge, which ques 
tions humanity’s ability to break out of its “mental cage” and attain 
“true” knowledge of the world. Philosophy took up the skeptical 
challenge in René Descartes’s Discourse on Method and Meditations, 
Kant’s Critique of Pure Reason, and elsewhere. Quine’s 1968 mani- 
festo, “Epistemology Naturalized,”™ calls for fighting the skeptical 
battle on the psychological rather than the philosophical front. Crit- 
ics of Quine’s naturalism claim the challenge cannot be adequately 
met by psychology. The new model takes no apriori stand in this 
controversy. The choice of strategy is an open question, to be de- 
cided based both on the nature of the challenge and on the avail 
ability of resources. The balance of these factors may vary from era 
to era, and with it the choice of strategy may vary. 

(b) Movement of special issues. Not only whole disciplines but also sin- 
gle questions can shift their position from the rear to the front. Take 
the Enischeidungsproblem, for example. As posed by David Hilbert in 
1900, this problem concerned the existence of a finite procedure for 
determining the solvability of Diophantine equations. But though a 
problem of considerable interest for mathematics, it was far from be- 
ing one of the central issues facing our system of knowledge at the 
turn of the century. In the 1930s and 1940s, however, the Entscha- 
dungsproblem became the focus of a serious challenge (or group of 
challenges) to our system. This challenge—subsumed under the 
headings of completeness, decidability, computability, recursiveness, 
and so on—concerns the scope and limits of a broad class of proce- 
dures and functions of special importance for knowledge. With the 
work of Gddel, Alonzo Church, Alan Turing, Emil Post, John von 
Neumann and others, the Entscheidungsproblem moved to the front of 
knowledge, leading to some remarkable gains in the battle against na- 
ture, as well as to some remarkable losses. 

(3) Change from being a “vanant” to being an “invariant” (“fixed”) con- 
stituent of knowledge (and vice versa). ‘The idea of the center and the 
periphery as representing the “fixed” and “varying” constituents of 
knowledge is a traditional idea: our language, logic, and mathemat- 


“ Ontological Relatrity and Other Essays (New York: Columbia, 1969), pp. 69-90. 
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ics are fixed, experiential knowledge is variable and changing. The 
new model relativizes this duality both to context and to time. The 
underlying principle is Otto Neurath’s: we build our system from 
within, by shifting our foothold from one area within it to another. 
Placing our foothold in area A—that is, holding the concepts, laws, 
and theories of area A fixed—we rebuild area B; shifting our 
foothold to area B (or to an area intersecting with B), we change 
area C; and so on.” The area we hold fixed at time interval ¢ is the 
center at t, the area we change at t, the periphery at t. This 
Archimedean principle is both dynamic and contextual: we can use 
A as a lever for changing B both at a particular time and in a particu- 
lar context. Among the many ramifications of this principle are: 

(i) Explanation of continuity through change. One of the main chal 
lenges facing any epistemology is the explanation of continuity 
through change. The problem is especially difficult for theories per- 
mitting radical change, like Quine’s theory with its universal revis- 
ability and inseparability theses, and Kuhn’s theory of scientific 
revolutions as paradigm shifts. In the current literature, the prob- 
lem is variably referred to as the “incommensurability” problem, the 
“meaning variance” problem, the “change of theory is change of sub- 
ject” problem, and the like. The new model is well equipped for 
handling this problem. Think of the “fixed” area of our system at 
i—the center at t—as containing those elements which are stable at 
t. The center is what “glues” the system together, the periphery is 
the locus of change and revision. During periods of “normal” devel 
opment, logic, methodology, and general scientific principles are 
held fixed in the center; during periods of radical change, it is con- 
ceivable that some combination of simple observation statements, 
everyday concepts, common-sense maxims, and logical and theorett 
cal elements (not undergoing change) constitute the center. The 
exact dynamics of variance and invariance is a research project for 
various disciplines within the model, but the model itself offers a 
flexible framework for the explanation of stability through change.* 


1 While the question of circularity arises in this situation, the applicable circu- 
larity criterion will have to take into account the nonfoundational nature of the 
model. 

* A similar approach to the stability problem is suggested by Mary Hesse in 
“Duhem, Quine and a New Empiricimn,” in Sandra G. Harding, Can Theories Be Re- 
futed? Essays on the Duhem-Qutne Thess (Boston: Reidel, 1976), pp. 184-204, and by 
Gochet, chapter 1, section 6. 
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(i) Foundations without foundahonaksm.* By allowing all sciences to 
occupy the same “structural” positions within the “sphere” (that is, 
center, periphery, and intermediate positions at varying distances 
from the two), the model subjects all sciences to the same (highly 
general) norms. Among these are the norms of critical evaluation 
and substantive justification. If we regard these norms as the founda 
tion norms, then every science within the model— including sciences 
that in traditional models are placed at the bottom of the founda- 
tional pyramid—requires a foundation. Neurath’s principle allows 
the model to provide such a foundation. Each science stands in a 
multitude of normative, conceptual, practical, and theoretical rela 
tionships to other sciences, and by selecting an appropriate stand- 
point among these sciences, we can use their resources to provide it 
with a foundation. While an apriori, universal foundation—that is, a 
foundation guaranteed in advance for all contexts and all tmes— is 
at variance with the model, its norms of unity, generality, and econ- 
omy favor broad foundations over narrow foundations. 

(iti) A standpoint for philosophy (“immanent transcendence”). The view 
that philosophy requires an external standpoint has led philoso- 
phers like Ludwig Wittgenstein and Richard Rorty to deny the possi 
bility of a substantive philosophy. A substantive theory of 
knowledge, for example, requires a viewpoint encompassing our 
world theory in its entirety, but such a viewpoint is not available to 
humanity. The new model allows us to reconcile the immanence of 
knowledge with the transcendence of philosophy. It suggests the 
possibility of a very broad standpoint: an area of knowledge from 
which we can view all other areas, both in their relation to each 
other and in their relation to the world (as seen from the chosen 
area). The chosen area may offer an indirect view of some of its own 
regions, as does arithmetic with respect to its own syntax and certain 
mathematical and scientific theories with regard to their own ontokb 
ogy.” ‘Thus, although an absolute transcendent viewpoint is ruled 
out by the model, a relative transcendent viewpoint—a point of view 
transcending all but a makeshift center—is compatible with it. Such 
a point of view is, indeed, characteristic of the model itself. The 
model offers a critical outlook on our system of knowledge from a 
standpoint within it, namely, that of a late twentieth-century view of 


” This phrase is taken from the title of Stewart Shapiro’s Foundations without 
Foundathonahsm: A Case for Second-order Logic (New York: Oxford, 1991). 

” See Quine, “Ontological Relativity,” in Ontological Relatrvity and Other Essays, 
pp. 26-68. 
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Quine’s early theses and the conflict arising from an attempt to rep- 
resent them by a static, absolutist model. 

This concludes my outline of the new CP. The model resolves 
the inner conflict in Quine’s theory by upholding the principle ofa 
center-periphery structure while denying the assumption of a 
“fixed” distribution of statements and theories within this structure. 
Every statement and theory lies in multiple areas of the structure, 
and as such it is not restricted to the norms of any one area. In par- 
ticular, philosophy is subject both to the norms of simplicity, utility, 
convenience, and economy (the norms associated with the center), 
and to the norms of truth, evidence, justification, explanation (the 
norms associated with the periphery and intermediate sections). As 
a result, philosophy is both conventional and substantive: both a 
practical discipline in the positivist tradition, and a factual, theoretr 
cal discipline in the image of metaphysics and science. 

GILA SHER 
University of California/San Diego 
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CAN A TURING MACHINE KNOW THAT THE GODEL 
SENTENCE IS TRUE?* 


Turing machine can know what it can prove, that is, deduce 
from axioms using well-defined rules of inference. Its ax- 


iomatic database may be large, and its rules of inference may 
be efficient and powerful. But for a Turing machine, knowabilty = 
provalniity. When truth and provability part company, as in the case 
of Gédel’s incompleteness theorem, a machine can cope with one 
but not with the other. 

The short answer to the question in the title is “no,” because if 
Peano arithmetic (PA) should turn out to be inconsistent, the Gödel 
sentence G would be false not true. The truth value of G depends on 
an unknown: the consistency of PA. In any case, truth and provabik 
ity diverge: for if PA is consistent, Gis true and unprovable; and if in- 
consistent, provable and false. On the other hand, conditionalizing 
on PA’s consistency yields ‘if...then’ sentences that are uncondition- 
ally true or false, and here the connection between truth and prov 
ability is more subtle. In section 0, two conditional sentences are 
examined: 

Al: If PA is consistent, then Gis not provable. 

A2: If PA is consistent, then ~Gis not provable. 


Both sentences are true, the difference being that the formal ver- 
sion of Al in the language of PA is a theorem, whereas the formal 
version of A2 to the best of our knowledge is not. Although a “PA 
Turing machine,” programed to run through and check proofs in 
PA, will be able to enumerate all PA theorems, and hence will know 
Al in the sense that it can prove it, the machine, unlike a human be- 
ing, will be incapable of recognizing, grasping, or otherwise perceiv- 
ing the truth of A2. The concept of “unprovable but true” is beyond 
it. Or so I shall argue. 

There have been two recent attempts, by John Lucas and Roger 
Penrose,’ to use Gédel’s theorem to show that any computational de- 
vice is limited, compared to human beings, in what it can know or 
understand. Their arguments, which if correct would imply that the 
program of strong AI was ultimately not viable, have provoked vigor- 


* The author wishes to thank Aml Gupta, Allen Hazen, Zlatan Damnjanovic, 
and Alasdair Urquhart for helpful comments. 

! Lucas, “Minds, Machines and Gédel,” Philosophy, xxxvi (1961). 112-27; Pen 
rose, The Emperor's New Mend (New York: Oxford, 1989), Shadows of the Mind (New 
York: Oxford, 1994). 
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ous rebuttals from other philosophers.” I shall argue along different 
lines. I focus on the sharp distinction between truth and provability 
and show that whether PA is consistent or inconsistent, the set of 
provable sentences differs from the true sentences. I then take up 
an example of a sentence in PA which is true, but which cannot be 
categorized as true by a PA Turing machine. The same holds for 
Turing machines with richer axiomatic bases; for each of them there 
will be at least one proposition which a human can know to be true, 
but which the machine lacks the capacity to recognize. At every 
level, there will be specific cases in which human mental powers out- 
strip the capabilities of Turing machines. 
L GODEL’S THEOREM 

Among the things that Godel showed is that, if PA is consistent, the 
concept of proof is representable in PA. This means that, if Pris the 
set of ordered pairs <x,y>, where x is the Gödel number (GN) of the 
proof of the wellformed formula with GN y then there is an open 
PA formula: 


So EEEO, EE 


which is such that for all xand y, <x,y> € Prif and only if 
E E, Mae 
where xis xin Peano notation, that is, 07” 
This is the definition of weak representability. So-called strong rep- 
a means that, for any x and y7, if <x 7 € Pr then 
Ha J, and if <x,y> ¢ Prthen F ~(_ ). 
Sonate enumerable sets are weakly representable; decidable sets 
are strongly representable. Since it is a routine clerical matter to de- 
termine which pairs <x,y> belong to Pr, Pr s a decidable set and is 
strongly represented by __x y On the other hand, al- 
though the set Th of Gödel numbers of theorems of PA is enumer- 
able because proofs are enumerable, Th is not decidable because the 
nontheorems of PA are not enumerable. The open formula that 


weakly represents Th is (Ax) (x y ). 


e 








? See Paul Benacerraf, “God, the Devil, and Gödel,” The Montst, LI (1967): 9-32; 
David Lewis, “Lucas against Mechanism,” Philosophy, xtIv (1969): 231-33, and “Lu- 
cas against Mechanism II,” Canadian Journal of Philosophy, 1x (1979): 373-76; G.L. 
Bowie, “Lucas’ Number Is Finally Up,” Journal of Philosophical Logic, X1 (1982): 279- 
85; Hilary Putnam, review of Penrose’s Shadows of the Mind, The New York Trmes Book 
Review (November 20, 1994), p. 7; also numerous commentators in Behavioral and 
Bran Sarmas, xt (1990): 655-92. 

* See Moshe Machover, Sat Theory, Logic and Ther Limitations (New York: Cam- 
bridge, 1996), p. 223. 
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Gödel also noted that the set Sub of triples <w,y,z> is decidable, 
where wis the GN of the formula which results from substituting the 
number y (in Peano notation) for all instances of free variables in 
the formula with GN z. Furthermore, w is a function of y and z, so 
that the open formula that represents Sub can be written in the form 
w = sub(y,z). Construct the formula: 


A3: ~( 4x) ( x sub(y,z) ) 
and let the GN of A3 be g. Then the Gödel formula Gis: 


G: ~(3x) (x sub(g, E) — ) 


Since sub(g,g) is the GN of G, G says of itself that it is not a theorem 
of PA. 

Is G provable in PA? Suppose, for reductio, that it were, and let A 
be the GN of its proof. Then <h sub(g g)> e Pr. By representability: 


- 


A4: A sub(g, g) — 
But from tH Gwe derive: 
A5: F (x)(x subg, g) —) 
whence by universal instantiation: 
A6:  ~(___A___sub(g,g)__) 


On the assumption that PA is consistent, A4 and A6 cannot both 


hold, and consequently G cannot be a theorem. 
IL. TRUTH AND PROVABILITY IN PEANO ARITHMETIC 


If G says that Gis not provable in PA, and if Gis in fact not a PA the- 
orem, then it would appear that what G says is true. G would be true- 
but-unprovable. But this does not by itself show that human beings 
can know something—the truth of G—that Turing machines cannot 
know, since both the truth and the unprovability of Gare predicated 
upon the assumption that PA is consistent. If PA should turn out to 
be inconsistent, G would be not true-and-unprovable, but provable- 
and-false. 

There are therefore two alternatives: if PA is consistent, G is un- 
provable and true; if PA is inconsistent, G is provable and false. 
Whichever alternative holds, provability and truth part company. Ei 
ther some truths are not theorems, or some theorems are not true. 
The domain of expertise of a Turing machine lies in the area of 
proof and provability, not in the area of truth. Human beings, on 
the other hand, are acquainted with both proof and truth, and also 
know of cases where the two diverge. 
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It might be argued, despite the above, that, if we conditionalize on 
PA’s consistency, truth and provability come together again. Thus, 
on the assumption that PA is consistent, G is true and unprovable, 
but at the same time the conditional sentence Al is true and, as we 
shall see, provable in PA. Similarly: 


A7: If PA is inconsistent, then Gis provable. 


is also true and provable. If, it is claimed, we focus on conditional 
sentences, truth and provability comcide once more and Turing ma- 
chines can know (=prove) whatever human beings can know 
(=prove). 

The details of this claim must be examined. Denote the Godel 
number (in Peano notation) of a well-formed formula A by [A], de- 
fine the expression Bew[A], which says that A is provable, as 
(dx)(__x«___[A]____), and note that Gis equivalent to ~Bew[G]. 
The assertion that PA is consistent may be symbolized in PA by a 
number of equivalent expressions, such as ~Bew[0 = 0’]. l abbrevi 
ate the latter by “Cons(PA).” Then Al is written in the language of 
PA as: 


AB: Cons(PA) —> ~Bew[G] 
or equivalently as: 


A9: Cons(PA) —> G 


Proving A9 as a theorem of PA is tantamount to proving Godel’s 
second theorem, which states that Cons(PA) is not provable in PA 
unless PA is inconsistent. (If Cons(PA) were a PA theorem, then, by 
A9, G would also be a theorem, which is known not to be the case.) 
In his original 1931 paper, Godel does not prove this second theo- 
rem; a proof is to be found in David Hilbert and Paul Bernays.* As- 
suming that this proof can be fully formalized in PA, AQ is both true 
and provable in PA. 

It follows from the above that in the case of the conditional sen- 
tences A8 and A9, truth and provability coincide, and a Turing ma- 
chine can know the truth of A9 because it can prove A9 as a theorem. 
But a different result holds for the formal version of A2. This is: 


A10: Cons(PA) —> ~Bew [~G] 


* Grundlagen der Mathematik, Volume 2 (Berlin: Springer, 1989). See Stephen 
Cole Kleene, Introduction to Metamathemahcs (A rasteri NorthHolland, 1952), p. 
211; and Elliott Mendelson, Fuiroducton to Mathematcal Logic (Princeton: Van Nos 
trand, 1964), p. 148. 
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Moving from A9 to A10 yields a candidate for the true-but-unprovable 
sentence that we seek. 

Is AlO true? Suppose, for reductio, that PA is consistent and 
that ~G is a theorem. Then [~G] €e Th, and, from the consis- 
tency of PA we infer that [G] ¢ Th. At the same time, since ~G 
is equivalent to (dx)(___ x ), from F ~G we 
derive H (åx) (X [G] ). Because the open formula 
(dx) x J weakly represents Th, it follows that [G] e 
Th, which completes the reductio. Hence A10 is true.’ 

The informal argument just presented cannot be converted into a 
formal proof of A10’s theoremhood, and there are good reasons to 
believe that A10 is in fact unprovable in PA. Because of the equiva- 
lence of ~G to Bew[G], A10 can be written as: 


All: Cons(PA) —> ~Bew[Bew[G]] 














In the logic of provability, one of the laws governing provability as a 
modal notion is: 


Al2: Bew[ X] —> Bew[Bew[X]] 


(“If X is provable, then it is provably provable”).° Instances of A12 
are theorems of PA. Since the converse of A12 does not hold, the 
consequent ~Bew[Bew[G]] of All is a net strengthening of the con- 
sequent ~Bew[G] of A8, and it is unlikely that the long and complex 
proof of A8 in PA could be reworked so as to yield a proof of All. 
Not impossible, perhaps, in the absence of a knock-down indepen- 
dence proof, but highly unlikely. 

The upshot is that a Turing machine programed to run through 
proofs in PA will almost certainly never classify A10 as a theorem 
(unless, of course, PA is inconsistent), but neither will it be capable 
of assigning it to the special category of “unproven but true.” Recog- 
nition of the existence of such a category therefore marks a differ- 
ence in principle between human and machine thinking. 

Despite the above reasoning, there will still be those who wish to 
maintain that somehow, in some fashion, a Turing machine can rec- 
ognize the truth of A10. There are three possible strategies for arriv- 
ing at this conclusion. The first is to show that “truth,” like 


* It is frequently stated that proving the nontheoremhood of ~G requires the as- 
sumption that PA is not only consistent but wconsistent (see, for example, 
Mendelson, p. 148). The argument given in the text shows, however, that the 
weaker assumption of ordinary consistency suffices. 

* See George Boolos, The Logic of Provalthty (New York: Cambndge, 1993), p. 15. 
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“provability,” is in some sense expressible by a formula of PA. The 
second is to formalize the notion of truth by introducing a new prim- 
itive sentential operator J, the truth operator, and show that in PA, 
suitably enriched with extra axioms, 7(A10) and similar theorems 
may be derived. The third is to hold that adding new axioms to PA, 
without the operator T, allows A10 to be proved, and that in stronger 
and stronger extensions of PA, a Turing machine can derive every 
proposition a human can know to be true. I shall argue that all 
these strategies fail. 
I. EXPRESSING TRUTH BY AN ARITHMETICAL FORMULA 
Tarski’s theorem states that the set Tr of Godel numbers of well- 
formed formulas of PA which are true in the standard interpretation 
is not “arithmetical,” that is, there is no open sentence____x of 
PA such that, for any x, x belongs to Tr if and only if the closed sen- 
tence___—*__is true in the standard interpretation.” The notion 
of “truth” in PA is consequently on quite a different footing from 
that of “provability.” The latter concept is represented by an open 
arithmetical formula; no analogous formula expresses (much less 
represents) the former, thus reinforcing the hypothesis that “truth,” 
though meaningful to humans is a closed book to a Turing machine. 
IV. INTRODUCING THE T OPERATOR 

The second possible way of supplementing a Turing machine’s abil 
ity to deal with proof by an ability to deal with truth is to introduce 
an explicit truth operator T. With suitable T axioms, the machine 
might then be able to prove propositions like: 





A13: T(If PA is consistent, then ~Gis not provable) 


For steps in the implementation of this strategy, see, for example, 
Solomon Feferman.’ There are, however, two worries. The first is 
that the T axioms may be able to do no better than supply, for any 
already-proved theorem X, a new theorem 7(X). If X were itself not 
a theorem of PA, as is probably the case with A10, then T(X) would 
also fail to be a theorem. If this were so, introduction of the T oper- 
ator would do nothing to correct a Turing machine’s inability to 
cope with unprovable but true propositions. 

The second worry is more general. T axioms may increase a Tur- 
ing machine’s deductive powers in the sense of giving it more theo- 


’ Alfred Tarski, “The Concept of Truth in Formalized "in Logic, Se 
maniics, Maamathematics (New York: Oxford 1956), pp. 152-278. See Mendelson, 
p. 151; Machover, p. 236. 

* “Reflecting on Incompletenes,” Journal of Symbolic Logic, LVI (1991): 1-49, here 
pp. 13 Œ 


TURING MACHINE AND GODEL SENTENCE 531 


rems it can prove, that is, broadening the extension of “proof.” But 
how is this relevant to “truth”? Take the analogy of non-Euclidean 
geometries. If a Turing machine is programed to accept the Gödel 
numbers of formal axiomatic proofs in Euclidean geometry, and if 
the parallel postulate is then replaced by Lobatchevsky’s postulate 
and the machine is reprogramed, it will generate a different set of 
proofs and a different set of theorems. But will the reprogramed 
machine categorize the old parallel postulate as “true but unprov- 
able"? Or as “formerly provable but false”? This seems unlikely. Be- 


tween provability and truth there yawns an unbridgeable gulf. 
V. STRONGER AND STRONGER EXTENSIONS OF PA 


The third way of getting a Turing machine to accept A10 as true, 
without introducing the operator T, is to strengthen PA and deduce 
A10 from new axioms. 

But again, there is a difficulty. No matter what system L of axioms 
and rules the machine is programed to use, and provided that L is at 
least as strong as PA so that “proof” and “substitution” are repre- 
sentable, there will always be a sentence analogous to A10 which is 
true, but for which the Turing machine will be unable to find a 
proof. In Zermelo-Frankel set theory, for example, an appropriately 
programed machine will be able to construct a Gödel sentence Gy, 
and a ZF-analogue of A10: 


A14: Cons(ZF) —> ~Bew[~G,,] 


The machine will be able neither to prove this sentence, nor to rec- 
ognize it as true. A human being, on the other hand, can know that 
every such sentence, in every extension of PA, is true. 

To sum up, what is the source of a Turing machine’s inability to 
recognize the truth of A10? Can the inability be done away with by 
moving to more powerful programs? Let us start with PA. 

A Turing machine, programed to run through and list all proofs 
valid according to the axioms and rules of PA, cannot recognize Al0 
to be true if it cannot prove it. For such a machine, there are only 
two categories of well-formed formulas: those which are theorems 
and those for which no proof has yet been found. (If PA is discov- 
ered to be inconsistent, the second category will turn out to be 
empty.) For the machine, the category “not yet proven but true” 
does not exist. 

What now of stronger machines, equipped with more powerful ax- 
iomatic bases and able to generate larger sets of theorems? What of 
machines equipped with the truth operator T? What of machines 
with huge data banks? It is difficult to see why the situation, for 
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COMMENTS AND CRITICISM 
DEFLATING THE CONSERVATIVENESS ARGUMENT 


n “Proof and Truth: Through Thick and Thin,” Stewart Shapiro! 
[=== that deflationist theories of truth are not really so defla- 

tionist, in that they require a notion of logical consequence too 
strong to be codified in an effective deductive system. He gives two 
related arguments for this conclusion, and there are hints of a third; 
I think all of them are mistaken. I shall take no stand on whether we 
can understand a noneffective notion of consequence;’ J claim only 
that the sorts of considerations Shapiro brings to bear do nothing to 
show a problem for a deflationist who sticks to first-order logic or 
some other effective logic. 

The term ‘deflationism’ has been used to cover quite a few differ- 
ent approaches to truth, and it is not clear precisely what they 
should be taken to have in common. Fortunately, there will be no 
need for me to be very precise: I shall argue that on no reasonable 
construal of the term do Shapiro’s arguments work. 

Before presenting Shapiro’s arguments, I shall call attention to 
one point to which every theorist of truth should agree: that by hav 
ing a notion of truth we increase our expressive power in an impor 
tant way. Indeed, this is a point that deflationists (or those who call 
themselves that) like to stress: the main point of having the notion of 
truth, many deflationists say, is that it allows us to make fertile gener- 
alizations we could not otherwise make; where by a fertile generaliza- 
tion I mean one that has an impact on claims not involving the notion 
of truth. If we disagree with everything we have ever heard the Pope 
say, we might conclude on inductive grounds that nothing the Pope 
says is true; this has the effect of committing us against not only what 
we have hedrd him say but also against what he has said which we 
have not heard. Since we do not know exactly what these unheard 
remarks are, there seems to be no way to disagree with all his re- 
marks except by use of the notion of truth (or some other device to 
the same effect). That is the main point of the notion of truth, ac- 


! This JOURNAL, XCV, 10 (October 1998). 493-521. 
* My own view is that we probably can understand the noneffective notion of conse- 


Undecidable Sentences Have Determinate Truth Values” in H. Garth Dales and Gian- 
luigi Oliveri, eds., Trith tx Mathematics (New York: Oxford, 1998), pp. 291-810. 
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cording to many deflationists: it allows us to make commitments 
(about matters not involving truth) that we could not make without 
it. Shapiro says that deflationists hold that truth is “metaphysically 
thin.” I am not sure what this means,’ but one thing that it better not 
mean is that we cannot use it to express important things inexpress- 
ible or not easily expressible without it, or that we cannot use it to 
make commitments about matters not involving truth beyond those 
commitments which we could make without it; for it is a clear part of 
deflationist doctrine that truth is not metaphysically thin in that 
sense. (We might put this by saying that everyone, deflationist or 
not, must agree that truth is not expresstusly thin.*) 

Shapiro’s arguments all turn on the well-known fact that in the 
framework of first-order logic, if you start out with a theory adequate 
to the elementary arithmetic of natural numbers (whether a theory 
of pure arithmetic or a more inclusive theory, say, one that postu- 
lates classes as well as numbers), and you add a truth predicate to 
the theory in a certain natural way, then the resulting theory T* is 
not a conservative extension over the theory T from which one 
started.’ To be slightly more precise, let us assume that we adopt a 
standard Gödel numbering of T, and that in T we define denotation 
relative to that Godel numbering in the usual way. We now assume: 


(i) that for each of the finitely many atomic predicates p of T, T™ licenses 
the usual general rule about how the truth of any sentence of form p(t, 
...,t,) depends on the denotations of 4,..,4, and that for each method 
of composing sentences out of simpler sentences, T* licenses the usual 
general rule about how the truth of the compound depends on the 


* I do not mean to blame Shapiro for the unclarity of the phrase: he might res- 
sonably say that the point of calling a theory of truth ‘deflationist’ Is to say that truth 
is in some sense “metaphysically thin,” and that it is up to the deflanonist to specify 
the sense. He might also reasonably say that many people who call themselves de- 
flationist have been less than clear about the kind of “thinness” they are attributing. 
I have no disagreement with this; but it is important not to attribute a sense of thin 
ness that deflationists explicitly disavow. (In my own deflationist moods, I would say 
that the most fundamental notion of truth wa logical notion applicable only 
to sentences we understand, and serving solely as a device of generalization; all 
other notions of truth—for example, those applicable to other languages—are to 
be explained in terms of this, plus relatively modest auxiliary notions.) 

* I concede that one deflationist, Paul Horwich—- Truth (Cambridge: Blackwell, 
1990)—has formulated his theory of truth in terms of principles that are not pow 
erful enough to give ‘true’ the expressive role that we want it to have. He has been 
correctly criticized for this by Anil Gupta, “A Crinque of Defladonism,” Philosoph- 
cal Tofacs, XXI (1998): 57-81. But Shapiro’s criticism is not just of Horwich, and is 
rather different 

* The truth predicate need only apply to anthmetc sentences. If it does, and we 
take the language of T to include numerals, we can smplify the truth theory by 
avoiding the notion of satisfaction. I tacitly suppose this done in what follows. 
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truth of the components. (For these purposes, take a sentence whose 
main connective is a quantifier over the natural numbers as “composed 
out of” the infinitely many sentences resulting from dropping the quan 
tifier and substituting a numeral for the quantified variable.) 


Also we assume: 


(ii) that T* allows mathematical induction on formulas containing the 
truth predicate. (Twill, of course, allow inductions on all formulas 
in the old language not containing the truth predicate; but extend- 
ing induction to formulas containing the truth predicate is an im- 
portant addition.) 


The well-known fact (a trivial consequence of Gédel’s second incom- 
pleteness theorem) is that a theory T* that extends Tin this way can- 
not be a conservative extension of T, if T is consistent: T* implies 
sentences in the language of T that T itself does not imply.® Shapiro 
claims that this fact makes trouble for a deflationist who wants to 
stick with first-order logic: supposedly, it forces the deflationist to 
adopt an expanded logic in which adding a truth predicate to our 
theories in the manner just outlined yields no new consequences in 
the old language. (An expanded logic with this feature must be very 
powerful: powerful enough to resist effective codification.) 

As Shapiro concedes, it is easy enough to add a truth predicate to 
number theory in a more limited way and get a conservative extension of 
what we started with; indeed, there are several ways to do so. He men 
tions a way that does not even allow the generalizations in (i), but only 
allows the Tarski biconditionals (which are derivable from (i) but do 
not entail any generalizations); call the resulting theory TL. But it is 
more interesting to add truth in a way that includes the general laws in 
(i), since I think it is clear that without such general laws the truth 
predicate would not serve its main purpose. Charles Parsons’ has 


* The idea is simply that (i) and (ai) in conjunction with T allow us to argue that 
all of the axioms of T are true, and that the rules of inference of T preserve truth, 
and therefore that all the theorems of Tare true. Also by (1) in conjunction with T, 
‘Om 1’ ig not true; so it is not a theorem of T. That is a sketch of how to derive in T* 
that ‘O™1" is not a theorem of T. But by the incompleteness theorem, we cannot 
derive this in Titeelf (if Tis consstent), so 7* does not conservatively extend T. 

’ For a proof, see hu Mathematics en Philosophy (Ithaca: Cornell, 1983), pp. 213- 
15. The idea is that only predicative sets of natural numbers (and the obvious 
principles governing them) are needed to turn Tarski’s inductrve clauses for truth 
into an explicit definition from which the generalizations in T, follow; but such a 
predicative extension of T is easily seen to be a conservative extension, so T, must 
be. (Even if you expand the induction schema in the original language into the 
claim that every set that includes 0 and 1s closed under successor contains all num 
bers, you still do not get the instances of the schema in the new language, absent 
umnpredicative assertions of set existence.) 
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noted that, if in adding ‘true’ we just add the general laws in (i), the 
extension (call it T,) is perfectly conservative (even in first-order 
logic); it is only when in addition one allows induction on predicates 
containing ‘true’ that we get new first-order consequences in the old 
language. Since truth can be added in ways that produce a conserva- 
tive extension (even in first-order logic), there is no need to disagree 
with Shapiro when he says “conservativeness is essential to deflation- 
ism” (497). Shapiro’s position, however, is that a deflationist must 
hold that adding ‘true’ to number theory in the full-blooded way that im- 
volves (ti) as well as (i) produces a conservative extension. Call this the 
conservative extension assumption. Since this assumption can only be so 
in a noneffective expansion of first-order logic," the crucial question is: 
Is there any good reason for a deflationist to accept the conservative 
extension assumption? There are several distinguishable lines of argu- 
ment for it in Shapiro’s paper. 

Shapiro’s simplest argument for the conservative extension as- 
sumption is as follows: 


If truth [yields new consequences in the old language when it] is admit- 
ted into the induction principle, it is counterintuitive to maintain that 
the notion is thin and insubstantial. How thin can the notion of arith- 
metic truth be, if by invoking it we can learn more about the natural 
numbers (499)? 


But the way in which we “learn more about the natural numbers 
by invoking truth” is that in having that notion we can rigorously 
formulate a more powerful arithmetic theory than we could rigor- 
ously formulate before. There is nothing very special about truth 
here: using any other notion not expressible in the original lan- 
guage we can get new instances of induction, and in many cases 
these lead to nonconservative extensions. So the question be- 
comes: 


How thin can the notion of arithmetic truth be, if by invoking it we can 
rigorously formulate a more powerful theory about the natural num- 
bers? 


But as I noted at the start of this paper, it is quite uncontroversial 
that the notion of truth can be used to make generalizations that 
cannot be made without it, and that these generalizations can be im- 
portant in giving rise to commitments not involving the notion of 
truth. Not only is this generally recognized, it is often cited by defla- 


* Here and elsewhere, I implicitly asume that contraction of first-order logic is 
not an option. 
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tionists as the main point of the notion of truth. Given this, it is 
hard to see any force in the question.’ 

Shapiro’s main argument is different, and cannot be dismissed 
merely by pointing out that no deflationist believes in “expressive 
thinness.” That main argument (497-99) has two components. Let 
B be a theory about a nonlogical subject matter, and let B’ be the 
result of adding ‘true’ to the language and extending B by adding 
new axioms that are (in Shapiro’s phrase) “essential to truth.” 
Then if truth is deflationary, B’ should be a conservative extension 
of B: for B’ is the union of B plus a truth theory TR; if it fails con- 
servatively to extend B, then there is a sentence ® in the truth-free 
language such that BUTR entails ® but B alone does not. In other 
words, the set of sentences BU{—®} is logically consistent and does 
not contain ‘true’, but TR is incompatible with it. Indeed, by com- 
pactness, TR must entail a ‘true’-free sentence of form B,&...&B,5® 
that is not logically true (where the Bs are in B}. But surely (this 
part of the argument concludes), we should not call a notion of 
truth TR deflationary if a theory that involves only axioms that are 
“essential to it” has nontrivial (not logically true) consequences 
about matters not involving truth. 

We can apply this to the case where B is a theory T adequate to 
number theory. Let us concede that not only are the Tarski bicon- 
ditionals “essential to truth,” so too are the general laws (i) used in 
so many applications of the notion of truth. If we concede only 
that, nothing of great interest follows: we already knew that the the- 
ory T, obtained by adding ‘true’ only with these laws is a conserva- 
tive extension of T. The potentially interesting result comes from 
assuming that the induction axioms involving true are also “essen- 
tial to truth”: since the result T* of adding these to T, is not a con- 
servative extension of Tif the logic is effectively codifiable, we get: 


(1) If the new induction axioms / involving ‘true’ are essential to truth, 
and logic is effectively codifiable, then the notion of truth is sub- 
stantial (not deflationary). 


* I might also remark that since deflationists invariably agree that there can be 
fertile generalizations that involve the notion of truth, there 1s no obvious reason 
why a deflationist cannot allow such generalizations to be used in explanations. I 
think that any view reasonably called ‘deflahonist’ must pose limits on the ways in 
which the notion of truth (and of truth conditions) is put to use in explanations, 
but I would have thought that any use of ‘true’ in explanations which derives solely 
from its role as a device of generalizanon should be perfectly acceptable. Shapiro 
makes a big deal of the fact that the notion of truth can be used ın explanations by 
virtue of its generalizing role (505-07), but this fact seems to me quite unsurprising 
and no threat to any reasonable version of deflationism. 
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That is the first component of Shapiro’s main argument. 

The second component of Shapiro’s main argument emphasizes 
the importance of the induction axioms: they are important, for in- 
stance, because we certainly want to be able to prove inductively that 
all theorems of T are true on the basis of the truth of the axioms and 
the truth preservation of the rules of inference, and the axioms Jare 
obviously needed for such a derivation.” So 


(2) The new induction axioms are essential to truth. 


From (1) and (2) it follows that, if logic is effectively codifiable, then 
truth is substantial. 

But I think it is clear that this argument rests on an equivocation 
with the phrase ‘essential to truth’. What the argument for (2) es- 
tablishes is: 


(2') The induction axioms are needed if we are to arithmetically derive 
important facts that involve the notion of truth. 


But the argument for (1) works only if its first antecedent is inter- 
preted as meaning: 


(3) The truth of the induction axioms depends only on the nature of 
truth. 


(If the truth of these axioms depend on facts about numbers as well, 
it is hardly surprising that number-theoretic conclusions should fol- 
low from them in conjunction with the purely truth-theoretic axioms 
T1) But (3) clearly does not follow from (2’). Moreover, (3) just 
seems false: the corresponding axioms would hold for any other 
predicate, and what they depend on is a fact about the natural num- 
bers, namely, that they are linearly ordered with each element hav 
ing Only finitely many predecessors. 

I have discussed the two most explicit arguments in Shapiro’s pa- 
per. But there are two points which Shapiro emphasizes on which I 
have put no weight, perbaps they are the basis for a third argument. 
The first point is that mathematical theories adequate to number 
theory usually are formulated in terms of axiom schemas (that is, it is 


a ERDE cd, E A ast t also hope to establish that all theo- | 
rems of T” are true, there is no hope of doing this in T*: for T* does not have 
the means for attributing or denying truth to the axioms about truth in 7*. 
More generally, Gédel’s second incompleteness theorem implhes that (in a lan- 
guage adequate to number theory) no consistent effective theory of truth that 
allows us to pass freely between the attribution of truth to a sentence and the 
sentence itself can have the means for inductively arguing that all its theorems 
are true. 
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laid down that all sentences of a certain form are axioms), and on a 
typical such formulation, the additional induction axioms / result 
simply from extending the range of instances of the schema to the 
larger language that includes ‘true’.” The second point concerns 
what is normally involved in commitment to a schema, at least in 
mathematics: the point is that people who commit themselves to the 
arithmetic induction schema mean to be committing themselves not 
only to its instances in the language they have but to all instances in 
any legitimate expansion, including one with a truth predicate; and 
similarly for other mathematical schemas, such as schemas of set ex- 
istence.” 

I fully concede both points, but they lend no credence to the 
conservative extension assumption, nor do they in any other way 


This is not so for all mathematcal theones adequate to arithmetic. For in- 
stance, consider a theory of natural numbers and predicative classes of them. This 
can be formulated nonschematically, with only finitely anoms, by formulating io- 
duction as a angle axiom using classes and axiomatizing the classes nonschemati- 
cally in the manner of Gddel-Bernays set theory. Adding ‘true’ to such a theory 
requires new class-existence assumptions or new instances of induction that obvi 
ously are not obtained by extending an old axiom schema. And that certainly sug- 
gests that the extension of schemas is not central to the topic of the 
conservativeness of the truth predicate: a conclusion soon to be argued on inde- 
pendent grounds. 

= This second point need not be so for schematic theones outside mathemat- 
ics. Shapiro notes the possibility of simulating anthmetic within physical theory: 
perhaps by singling out a specific physical object to correspond to 0, and defin- 
ing a predicate N coreapandine to ‘number’) true only of certain physical ob- 
jects located on a ray emanating from this 0, and certain operations on the 
Mobjects corresponding to successor, addition, and multiplication. To adopt the 
analogues of the first-order arithmetic axioms is to adopt a certain theory of the 
Nobjecta: one which is committed to the set of Mobjects being infinite ın extent, 
and which is also committed to induction holding on Mobjects, for predicates in 
the language of the theory. Suppose that N is so defined that the successor of an 
Mobject is always at least one mile from that object; then, if we assume that space 
is Archimedean (no two objects are infinitely far from each other), inducton 
must also hold for predicates not expresmble in the language. But we can imag 
ine the possibility of adopting a cosmology on which space is non-Archimedean; 
or at least, of leaving the possibility of such a cosmology open. In that case, it 
would not be out of the question to accept induction on Mobjects for predicates 
in the original language, but reject or remain noncommital about accepting it 
for predicates in the expanded language. (For N defined in a way that imposes 
no distance requirement on the successor of an object, we might be able to get 
an example that did not require our theorist to take non-Archimedean space se- 
nously.) 

The analogous attitude for natural numbers is, of course, absurd: ıt is part of the 
idea of natural numbers that they are Archimedean, that is, ıt is part of the idea 
that induction holds on any predicate of numbers we could ever come up with. it 
is something about our idea of natural numbers that makes it absurd to suppose that in 
duction on the natural numbers might fal in a language expanded to include new 
predicates (whether truth predicates or predicates of any other kind): nothing 
about truth is involved 
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make trouble for a deflationist who wants to stick to first-order 
logic. In the first place, it is hard to see how the fact that (in 
mathematics at least) we are committed in advance to extend our 
schemas to new predicates could possibly show that the theories so 
extended conservatively extend the theories prior to the exten- 
sion. In the second place, this point about the extension of 
schemas has nothing specially to do with truth: we are committed 
in advance to extending our schemas to all new predicates, not 
just ‘true’. 

Late in the article (511) Shapiro seems to suggest that the fact that 
we are committed in advance to extending induction to any new 
predicates we introduce shows that our understanding of the natural 
numbers exceeds the bounds of first-order logic. I think this is a 
mistake, but shall not discuss it here. There is no need to, because if 
it is correct it gives reason to go beyond first-order logic which is 
quite independent of deflationary theories of truth. The commit- 
ment to extend induction to newly introduced predicates has noth- 
ing special to do with truth: it applies to all predicates, not just truth. 
So it is hard to see how one’s theory of truth could have any special 
relevance to it.” 

HARTRY FIELD 
New York University 


D Since it ıs only the expansion of the schema to new instances involving 
‘true’ that Shapiro makes use of, we might be able to avoid all issues of indefinite 
extensibility by imagining that we start out by accepting a schema of induction m 
a language that contains tis own truth predicate This would not actually avoid the is- 
sue of indefinite extensibility if, like Tarski, one takes the position that there uw 
an infinite hierarchy of truth predicates for a given language. for then adding 
any one yields a nonconservative extension of the original language. And 
Shapiro does seem to hold to the Tarskian view. But there is a respectable alter- 
native position: that we have a single truth predicate for our language, applicable 
to sentences that contain it as well as sentences that do not. Of course, we know 
from Tarski that the truth theory m such a language cannot be fully classical, in 
the sense that not all instances of the Tarski schema ‘“p” is true if and only if p' 
can be accepted. (More accurately, a nonclassical alternative must either restrict 
the Tarski schema or restrict clasncal logic; I shall not consider the latter poasibility 
here.) For an exploration of an important family of nonclassical alternatives 
(that keep classical logic), see Harvey Friedman and Michael Sheard, “An Ax- 
jomatic Approach to Self-referental Truth,” Annals of Purse and A Logic, 
xxxm (1987): 1-21. And for a discussion of how this connects up to the point of 
footnote 10, see the last section of my “Which Undecidable Sentences Have De- 
terminate Truth Values?” 
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COMMENTS ON SHAPIRO* 


et us say I am committed to deflationist truth: I think aug- 

menting a language with a truth predicate, Alfred Tarski’s per- 

haps, or some substitutional variant, does not saddle me with 
new metaphysical commitments: “truth is not a metaphysically sub- 
stantial property, and it has no nature,”' J am prone to say (when- 
ever the topic of truth comes up). 

Stewart Shapiro challenges the idea that this sits comfortably with 
“the logician’s use of truth as a sign of expressive and proof-theoretic 
strength,” and hopes to show the only way for the deflationist to re- 
solve this tension is to embrace a “very strong, noneffective notion of 
logical consequence” (494-95). Since the aim is to show the defla 
tionist is forced to a noneffective consequence relation, let us keep 
our eyes firmly on Shapiro’s target, the deflationist commited to the 
first-order consequence relation, what I shall call the first-order defia- 
honist. 

Shapiro’s ingenious argument is quite intricate, but we need only 
concern ourselves with the considerations that he takes to give rise 
to the tension with which the first-order deflationist must deal. As 
with Shapiro, let £ be a language whose syntax is effective, and sup- 
pose £ can express a certain amount of arithmetic. The variables of 
£ range over the natural numbers, and we introduce a Gödel num- 
bering: | ® | which assigns every natural number a sentence.’ A is an 
effective and sound theory in £ and contains the rudimentary ax- 
ioms of arithmetic. PRF(x, y) is the proof predicate for A definable 
in Ẹ, and let G= ~3yPRE(|G a the fixed point for ~JyPRE (x, y). 

Shapiro concedes that a new system A’ containing a truth predi- 
cate (following Tarski) can be constructed, one which is capable of 
proving every I-sentence and is conservative with respect to A. 
Shapiro claims, however, that such a theory does not contain every- 
thing “essential” to “arithmetic truth” (499). What is required of “an 
adequate theory of truth” is that it have “the resources to establish” 
that the axioms of A are true and that the rules of inference preserve 
truth. In short, it should be able to establish: 


(*) Wx(GyPRF (x, 9) 9 712) 


* My thanks to Jonathan Kastin, Arnold Koslow, and George Smith for com- 
ments 

! Stewart Shapiro, “Proof and Truth’ Through Thick and Thin,” this JOURNAL, 
xcv, 10 (October 1998): 493-521, here p 494. 

* Shapiro chooses a Gödel numbering onto the natural numbers, and I follow suit 
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Furthermore, it should also be able to establish —0=1, and thus, 
via a [-sentence, “7(0=1 1). But under such circumstances, it can 
establish: 


—oyPRF((0=1 |, y) 


which says that A is consistent, and therefore is not provable in A (by 
Godel’s second incompleteness theorem). So this additional “essen 
tial” element of the theory of truth yields a theory of truth for A 
which is not conservative (with respect to A). 

Something slippery just happened here. Although the deflationist 
agrees that “generalizations” concerning truth are “essential to” a the- 
ory of truth (and by this, she means, for example, clauses such as, 
| ® ] is true and |] is true if and only iff ® & Ylis true), she does not 
think it “essential to” a theory of truth to establish generalizations 
about truths. What 1s true (and how) is not, properly speaking, part of 
the theory of truth. In particular, although the deflationist is perfectly 
willing to describe —0=] and (*) as perfectly obvious, she still has no 
reason whatsoever to regard them as part of the theory of truth. Call 
physical truth the truths in some physical language £, that follow from 
some physical theory Åp: surely, it is not required of the deflationist 
that when she supplements £p with a theory of truth that it should 
follow from her theory of truth that everything that follows from Ap, is 
true. And if she does introduce the theory in such a way that this can 
be established from her theory of truth, why should she expect the re- 
sulting theory to be conservative (and, anyway, why would she call 
such a theory thus augmented a “deflationist” one)? 

The point is a simple one: the capacity to establish (nonlogicall) 
truths and generalizations about such goes quite beyond what a first- 
order deflationist calls a deflationist theory of truth—and this re- 
gardless of how obvious such truths and generalizations (about 
them) are. The first-order deflationist can even grant that in the 
case of a theory of truth for arithmetic it is desirable to introduce re- 
sources for characterizing truths that go beyond deflationist truth ((*), 
for example): in doing so, of course, she notes that the resulting 
bundle is not deflationist truth and so one should hardly be surprised 
at its nonconservativeness. 

The second argument Shapiro offers that the first-order defla- 
tionist is involved in a tension involves the induction scheme for 
arithmetic. Given “something like” Tarskian satisfaction in A’, if 
we allow instances containing the truth predicate T in the induc- 
tion scheme, then the resulting truth theory is not conservative 
(499). 
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The first-order deflationist, in light of what has been said above, 
should simply deny that allowing instances containing T in the in- 
duction schema results in a deflationary truth theory and, of course, 
then there is no surprise at the resulting theory failing to be conserv- 
ative. But Shapiro invokes Michael Dummett’ (and Shaughan 
Lavine) as authorities against this move, because “it is part of the 
concept of natural number, as we now understand it, that induction 
with respect to any well-defined property is a ground for asserting all 
natural numbers to have that property” (500; italics added by 
Shapiro). What follows is an argument that, if someone admits that 
a predicate P (however defined) “has a determinate extension over 
the natural numbers (or whatever the numerals denote),” then he 
must allow that predicate to appear in instances of the induction 
scheme (501). 

Unfortunately, for this line of thought, these intuitive considera 
tions apply ony to the standard model of arithmetic, and only to con- 
cepts of natural number so restricted: it is simply not true of other 
models of A that any predicate P (however defined) with a determi 
nate extension over the numbers of that model belongs in the induc- 
tion scheme. Indeed, what allows the existence of nonstandard 
models for any first-order theory containing arithmetic in the first 
place is the fact that although the induction schema holds of any 
predicate definable in the language of that theory (and so the induc- 
tion schema restricted to predicates of that language holds in all 
models of that theory) this is not true of every definable predicate in 
that same language enriched with a truth predicate; not all of the 
models of the restricted induction schema are models of the induc- 
tion schema relative to the new language. This is why allowing the 
truth predicate in the induction schema is not conservative. The 
first-order deflationist (especially that deflationist who has conceded 
to Shapiro that “metaphysical robustness lines up with logical conse- 
quence” (498)*) will therefore deny that A picks out the standard 
model to the exclusion of the other models in which A holds (and 


* “Reply to Wright,” in Bran McGuinness and Gianluig Oliveri, eds , The Phaloso- 
phy of Micha Dummett (Dordrecht Kluwer, 1994), pp. 329-38, cited in Shapiro. 

* The point of ‘especially’ is that a deflationist need not even concede that meta 
physcal robustness line up with logical consequence. She may be willing, for ex- 
ample, to tle together an effectively characterized sequence of iat lars ci 
containing truth predicates for the theories earlier in the sequence, and describe 
all such theories (stipulatively) as about the same subject matter even though they 
are not conservative with respect to each other. Of course, such a set of (first-or- 
der) theories no more “grasps” the standard model than any first-order theory 
does. 
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therefore will deny Shapiro’s claim that, “The idea is that the notion 
of arithmetic truth is somehow implicit in the arithmetic concepts 
[and] [t]his much is congenial with the main, but vague deflationist 
theme that the truth predicate T refers to an insubstantial property” 
(501). Arithmetic truth here refers, as before, to truth in the stan- 
dard model, and the first-order deflationist will deny that A (or any 
effective axiomatization of arithmetic for that matter) somehow (!) 
implicitly grasps truth in the standard model except in the mere 
sense that the standard model is implicitly grasped among other 
nonstandard models which are also implicitly grasped. Shapiro’s ar- 
gument can trap the first-order deflationist into conceding a nonef- 
fective consequence relation only if she already has implicitly 
adopted the idea that somehow a first-order axiomatization itself 
noneffectively (but implicitly) picks out the standard model. 

I submit, therefore, that Shapiro has failed to show any tension 
that a first-order deflationist need feel between a commitment to de- 
flationist truth as described above, and the fact that—2in certain con- 
texts—a Tarskian truth predicate is not conservative. 

JODY AZZOUNI 
Tufts University 
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MEANING AND DEFLATIONARY TRUTH* 


“Charity...is not puffed up”—-I Corinthians, 13, 4. 


would like to discuss Donald Davidson’s' reasons for rejecting 
deflationary views of truth, while being particularly concerned 
with his claim that deflationism must be rejected because the 
concept of truth plays an essential explanatory role in the theory of 
meaning. Accordingly, my discussion will bear on the more general 
question as to what deflationists about truth can or should say about 
meaning. 
I INTRODUCTION 
Davidson agrees with the deflationists in opposing all traditional the- 
ories of truth? He denies, however, that this rejection requires re- 
placing accounts of truth in terms of “correspondence” or 
“coherence” with something more stripped down. The deep mis- 
take, which deflationists share with traditionalists, is to define truth 
at all. We can shed light on the concept of truth by tracing its rela- 
tions with other important concepts: notably, belief and meaning. 
But we cannot define it. At the end of the day, we have to accept the 
concept of truth as an exploratory primitive. 
This hostility to deflationary accounts of truth ought to strike us 
as puzzling. Davidson's views about meaning owe large and gener- 


* I want to thank Paul Horwich and Hartry Field for numerous extremely help- 
ful discussions of the issues dealt with in this paper 

| “The Structure and Content of Truth,” this JOURNAL, LXXXVI, 6 (June 1990): 
281-328, “The Folly of Trying to Define Truth,” this JOURNAL, xc, 6 (June 1996): 
263-78. 

! For this reason, Richard Rorty bas tried to recruit Davidson for the deflationist 
camp. See Rorty's “Pragmatism, Davidson, and Truth,” ın Ernest Lepore, ed., 
Truth and Interpretation (New York: Blackwell, 1986), pp. 383-68 He does not dis- 
tinguish clearly, however, between two ways in which there might not be “much to 
say about truth”. adopting a deflationary account and taking truth as a primitive. 
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ously acknowledged debts to W. V. Quine, who is a deflationist with 
respect to truth. To be sure, Davidson’s views diverge from 
Quine’s in important respects. But it is far from obvious that the 
differences mandate a wholly distinct attitude to truth. In my view, 


they do not. 
I. TRUTH: FUNCTION AND MECHANISM 


Let us glance at some current deflationary approaches to truth, start- 
ing with Quine’s’ disquotational theory. According to Quine, the 
meaning of ‘true’ is fixed by certain logical equivalences. Thus: 


‘All that glisters 1s not gold’ is true if and only if all that glisters is not 
gold; ‘France is hexagonal’ is true if and only if France is hexagonal... 
and so on. 


Appending ‘is true’ to a quoted sentence is just like canceling the 
quotation marks (“disquotation”). For this approach, the core use of 
‘true’ involves cases where a sentence is referred to by its quotation- 
mark name. But given additional empirical information—for exam- 
ple, that a certain sentence expresses Fermat’s Last Theorem—we can 
easily ascribe truth to sentences picked out in other ways. 

The disquotational theory treats ‘true’ as a predicate of sentences. 
But similar accounts of the behavior of the truth predicate can be 
given, mutatis mutandis, if we prefer to think of ‘true’ as predicated 
of propositions. The axioms of such a theory will be: 


The proposition that all that glisters is not gold is true if and only if all 
that glisters is not gold...and so on. 


Paul Horwich’s* “minimal theory” takes this form. 

A recent variant of disquotationalism is Robert Brandom’s® ver- 
sion of the “prosentential” account of truth. According to Bran- 
dom, we should see ‘...is true’ not as a predicative expression but as a 
sui generis prosentenceforming operator. Prosentences stand in for 
sentences in the way that pronouns stand in for referring expres- 
sions. Thus, appending ‘...is true’ to a sentence nominalization 
yields a prosentence which, as a whole, inherits its content anaphori- 
cally from whatever sentence tokening its contained sentence nomb 
nalization picks out (ibid., pp. 299-305). 


* Pursu of Truth (Cambridge: Harvard, 1990). 
* Truth (Cambridge: Blackwell, 1990). 
* Makmg l Expkat (Cambndge. Harvard, 1994). 
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Compare any of these accounts of truth with Frank Ramsey’s® re- 
dundancy theory, according to which ‘It is true that Caesar was mur- 
dered’ means no more than that Caesar was murdered (thid., pp. 
34-35). (Any difference is entirely “stylistic”: for example, we may 
use ‘It is true that...’ to speak more emphatically.) In effect, the re- 
dundancy theory treats ‘It is true that...’ as a sentential connective 
with the same truth table as double negation. As a result, the redun- 
dancy theory cannot handle cases where is ‘...is true’ seems to func- 
tion as a predicate attached to sentences designated by a singular 
term, as in ‘Fermat’s Last Theorem is true’. To cope with these 
cases, truth talk must be seen as involving some kind of denominaliz- 
ing operation. This is just what contemporary versions of deflation- 
ism let us do. 

The great advance of Quine’s view over the redundancy theory, 
then—an advance which its current rivals preserve —is that it shows 
how truth talk contributes significantly to the expressive resources 
of our language. For a disquotationalist, ‘true’ offers a way of re- 
placing talk about the world with logically equivalent talk about 
words. This move to the level of talk about words (“semantic as- 
cent”) gives us new things to generalize over—that is, linguistic ob- 
jects, sentences—thereby enabling us to express agreement and 
disagreement with sentences that we cannot specify: for example, 
because we do not know exactly what they are (“What the President 
said is true’) or because there are too many of them (‘Every sen- 
tence of the form “Por not P” is true’). 

We see from this that we must distinguish two ways in which defla- 
tionary approaches to truth aim to be “deflationary.” The first—and 
fundamental—has to do with the function or significance of truth 
talk. On this issue, contemporary deflationists are in broad agree- 
ment: the function of truth talk is wholly expressive, thus never ex- 
planatory. As a device for semantic assent, the truth predicate allows 
us to endorse or reject sentences (or propositions) that we cannot 
simply assert, adding significantly to the expressive resources of our 
language. Of course, proponents of traditional theories of truth do 
not deny any of this. What makes deflationary views deflationary is 
their insistence that the importance of truth talk is exhausted by its 
expressive function. 

The second aspect of deflationism concerns the mechanism of 
truth talk: the logico-linguistic means by which truth talk accom- 


* “Facts and Propositions,” Procaedings of the Anstotehan Sociaty, Supplementary 
Volume vu (1927): 153-70, reprinted in D H. Mellor, ed, FP Ramsey: Philosophical 
Papers (New York: Cambridge, 1990), pp. 34-51. 
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plishes its expressive function. Here deflationists diverge: Quine 
speaks of disquotation, Horwich treats truth as a predicate of propo- 
sitions, Brandom thinks of it as a sui generis prosentence-forming 
operator, and so on.’ Nevertheless, there remains a common inten- 
tion to explain the workings of truth talk in a metaphysically parsi- 
monious way. So from a theoretical standpoint, these competing 
suggestions are all deflationary. 

These two aspects of deflationism — functional and theoretical—are 
obviously connected. ‘Traditional theories of truth—correspondence, 
coherence, pragmatist—presuppose that truth is an explanatorily sig- 
nificant property. Just as, because acidity is an important explanatory 
property in our accounts of chemical reactions, we want also to explain 
what it is that makes some substances acidic, so a philosopher who sup- 
poses that truth has important explanatory work to do—in the philoso- 
phy of language or epistemology—will want to explain what makes 
true sentences (propositions, beliefs, and the like) true. By contrast, if 
we regard the function of truth talk as only expressive, there need not 
be any complicated story about the nature of truth. Given deflationism 
with respect to truth talk’s function, it makes sense to suppose that its 
mechanism can be explained using fairly limited theoretical resources. 

IIL IS THERE MORE TO TRUTH THAN CONVENTION-T? 
Davidson takes it to be common ground among deflationists that Al- 
fred Tarski® has told us pretty much everything we need to know 
about truth. In Davidson’s view, however, Tarski does not really de- 
fine truth because his method for defining “truth predicates” for par- 
ticular formal languages does not explain what those predicates have 
in common. Several ideas are in play here. 

As is well known, Tarski declines to define a general notion of 
truth because he thinks that such a notion invites paradox. Instead, 
he shows how to define, in a suitable metalanguage, a metalinguistic 
predicate ‘truein-L’ for an appropriate language L (the object lan- 
guage). Since he defines truth in terms of satisfaction, a generalized 
notion of reference, from a deflationary standpoint, Tarski might be 
regarded as telling us more than we need to know about truth. Still, 
his constructions are deflationary in spirit because he defines satis- 
faction (for L) by cases. And this is Davidson’s point: such a proce- 
dure offers no general understanding of anything since “a definition 
given in this way can provide no clue for the next or general case.” 
This limitation is inherited by the truth definitions themselves in 


’ There may be less to these differences than meets the eye. 
* “The Concept of Truth in Formalised es,” in Logic, Semanhcs, Mata- 
mathematics (New York: Oxford, 1956), pp. 152-278. 
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that “Nothing in Tarski’s truth definitions hints at what it is that 
these definitions have in common.” We must conclude, therefore, 
that “there is more to the concept of truth—something absolutely 
basic, in fact—~which Tarski’s definitions do not touch.”* 

None of this is an objection to Tarski and Davidson does not sup- 
pose that it is. Tarski’s truth predicates are defined for certain for- 
mal languages, where the interpretation of basic terms is a matter of 
stipulation. If we want to extend such a language by adding a new 
term, we can assign it whatever extension we choose. We do not 
need any “clues” as to how to go on. But to say this is simply to point 
out the gap between Tarski’s constructions and ordinary truth talk. 

Now, it is true that, for Tarski, when we extend a language by 
adding new terms, we introduce a wholly new satisfaction relation, 
and so a wholly new truth predicate. But it is misleading to say that 
Tarski does not even hint at what this new predicate has in common 
with the old. What makes all Tarski’s predicates truth predicates is 
that they meet the same standard of material adequacy. The stan- 
dard of adequacy in question is, of course, the famous convenkon-T: 


(T) Xis true if and only rf p. 


(where ‘X’ is a name for some sentence of the object language and 
‘p’ is a sentence of the metalanguage that correctly translates X). A 
definition of truth will be considered “materially adequate” only if it 
allows us to prove each member of the appropriate set of equiva- 
lences (if it “meets convention-I”). Of course, the formal truth def- 
initions do not themselves make this clear: that is, they do not 
themselves say why the predicates they define are truth predicates. 
Nor do they explain the point of having such metalinguistic predi- 
cates. But why is this a problem? 

As Tarski makes clear, and as Davidson is of course well aware, it is 
via convention-T that his formally defined predicates make connec- 
tion with our intuitive conception of truth. I think that this justifies 
Horwich’s claim that equivalences of the sort to which convention-T 
points, and not Tarski's constructions, are what really give the con 
tent of the intuitive notion of truth. We may therefore want to 
gay—particularly in the light of their restriction to artificial formal- 
ized languages and their enumerative approach to defining satisfac- 
tion relations—that these constructions add nothing to whatever 
understanding of truth is implicit in convention-T. It is only in virtue of 


* "The Structure and Content of Truth,” pp 287, 288 
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that prior understanding that we recognize Tarskistyle definitions as 
having anything to do with truth. But that is a comment on Tarski, 
not on deflationism generally. It does not show that the deflationist 
account of what constitutes that prior understanding is inadequate. 

The suspicion may still linger that convention-T does not really ex- 
plain what such definitions have in common. This suspicion is just+ 
fied. A criterion of adequacy will not itself explain the point of 
having metalinguistic predicates that satisfy it But as we have seen, 
deflationism involves an account of the function of truth talk as well 
as (rival) accounts of its mechanism. This is the point I once tried 
to make by responding to Davidson’s question “What do such predi- 
cates have in common?” with the remark: “the utility, for any lan- 
guage, of having its own disquotation device.” Davidson comments 
that it is difficult to assign a clear meaning to the “utility” of a device. 
But the utility in question is simply the addition to our expressive re- 
sources which comes with a device for semantic ascent.” 

So far, then, Davidson’s reflections on convention-T fail to pose 
significant problems for deflationary approaches to truth. He has a 
much more serious objection to make, however. A further reason 
for rejecting the idea that convention-T goes to the heart of our con- 
cept of truth is that: 


in those cases where the object language is contained ın the metalan- 
guage, the requirement is merely syntactical: ıt tells us something about 
the predicates, but not much about the concept. In other cases its ap- 
plication depends on our prior understanding of the notion of transla- 
tion, a concept far more obscure than that of truth.” 


Consider Quine. He explains the core use of ‘true’ by appeal to 
equivalences that involve sentences and their quotation-mark names. 
Following Hartry Field,” we can call the concept of truth thus de- 
fined purely disquotahonal Pure disquotationalism does characterize 
the mechanism of truth syntactically. Evidently, this austere notion 
of truth, as grasped by a given person, applies only to utterances that 
the person in question understands. Equally evidently, however, or- 
dinary truth talk ranges more widely than this: hence the standard 
objection to disquotationalism, of which Davidson’s point is a ver- 
sion, that we sometimes wonder about the truth of an utterance we 


” “The Structure and Content of Truth,” p. 296, note 33. 

u Ibid., p 296 

" “Disquotationalism and Factually Defective Discourse,” Philosophical Review, 
ci, 3 (July 1994): 405-52. See espeaally pp. 405-09. 
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do not understand. Theoretical austerity, it seems, compromises 
functional adequacy. 

This objection is not immediately decisive. As Field points out, a 
deflationist can say that, in the sort of case Davidson finds problem- 
atic, we are “wondering whether a good translation of the utterance 
will map it into a disquotationally true sentence we do understand” 
(ibid., p. 408). Nevertheless, the objection poses a significant ex- 
planatory problem for the disquotationalist. If functional adequacy 
is not to compromise theoretical parsimony, whatever extensions of 
truth talk such examples require must be explicable in terms of dis- 
quotational truth together with fairly limited additional resources. 

On this issue, I see no serious difference between Horwich and 
Quine. They may appear to diverge, since Horwich’s minimal the- 
ory invokes propositions. If he can unpack talk of propositions in a 
suitably parsimonious way, however, the difference will be more ap- 
parent than real. Once we admit the need for extended disquota- 
tionalism, we see that any deflationist will have to say something 
about meaning; and if he wishes to avoid compromising his theoret- 
ical deflationism, he will have to be careful what he says. 

Careful in what way? Again following Field, we can agree that the 
more radical and interesting forms of deflanonism will not count ob- 
jective interpersonal synonymy as belonging to these resources. 
Thus, a deflationist who appeals to a notion like “good translation” 
will add that “‘good translation’ should be taken to be a highly con- 
text-sensitive and interest-relative notion; it should not be under- 
stood as meaning ‘correct translation’, for talk of correctness of 
interpersonal translation makes no sense” (ibid). 

But Field may be understating the antideflationist challenge. With 
respect to extending disquotational truth in a way that remains theo- 
retically deflationary, the problem is not just one of eschewing ap- 
peal to objective synonymies. The real disaster would be if our 
notion of good translation had to be explained in terms of a nondis- 
quotational notion of truth. This would be a disaster no matter how 
context sensitive our notion of translation turned out to be. If 
meaning (or translation) depends on truth, and if an adequate func- 
tional deflationism requires extended disquotationalism, the attempt 
to be a functional deflationist will be self-defeating. This is just what 
Davidson thinks. Because he explains meaning in terms of truth 
conditions, he cannot be a deflationist—functionally or theoreti- 
cally—with respect to truth. 

In saying that he accepts truth as a primitive notion, Davidson 
means that the concept of truth is not something we should try to 
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define or, if we doubt the possibility of giving a strict definition, cap- 
ture by some “brief criterion, schema [or] partial but leading hint.” 
He does not mean that the concept cannot be illuminated. But illu- 
mination is to be found in tracing “the connections between the 
concept of truth and the human attitudes and acts that give it body” 
(thid.). Davidson has in mind believing, asserting, interpreting, un- 
derstanding, and so on. 

Deflationism, as Davidson understands it, takes the opposite view. 
It reflects just the urge he thinks we must resist: to characterize truth 
exhaustively in terms of some brief formula. Accordingly, deflation- 
ism is committed to the view that the concept of truth can be under- 
stood in isolation from such concepts as belief or meaning. But 
although “(t]he disquotational feature of truth, wedded to the 
thought that this may exhaust the content of the concept of truth, 
encourages the idea that truth and meaning can be kept quite sepa 
rate” (ibid. p. 271), this idea is mistaken. With such concepts as be- 
lief, truth, and meaning, none is to be defined. But all can be 
illuminated by tracing their relations with each other. 

Now, while Horwich does remark that truth has “a certain purity,”" 
I am not sure that it is correct to represent deflationists as holding 
that truth can be understood ful in complete isolation from consider- 
ations having to do with meaning, belief, and the like. As we have 
seen, disquotationalists accept the need for an extended notion of 
disquotational truth. All they are committed to is explaining the ex- 
tension in suitably deflationary terms: in particular, without covert ap- 
peal to some richer-than-disquotational truth concept. So the 
question is: Does Davidson's approach to meaning give us reason to 
think that they must fail in this attempt? I do not think that it does. 

IV THEORIES OF MEANING 
The key to understanding why Davidson's hostility to deflationism is 
misplaced is to distinguish two senses of the phrase ‘theory of mean- 
ing’. In a broad sense, a philosopher's theory of meaning is his ac- 
count of what meaning consists in. But although Davidson certainly 
advances a theory of meaning in this sense, he also uses the phrase 
‘theory of meaning’ more narrowly. In this narrow sense, a David- 
sonian theory of meaning is an axiomatic theory that generates the 
meaning of every sentence of some target language. Now, since the 
‘meaning’ thus generated is simply the sentence’s truth condition, it 
is natural to suppose that Davidson identifies meanings with truth 


» “The Folly of Trying to Define Truth,” p. 276. 
” Horwich, quoted by Davidson, “The Folly of Trying to Define Truth,” p. 274. 
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conditions, so that his broad sense theory of meaning is that mean- 
ings are truth conditions. If this were so, his conception of a (nar- 
row sense) theory of meaning would determine his general account 
of what meaning is, so there would not be much point in insisting on 
the distinction. But, of course, this cannot be the end of the story. 
Davidson owes (and provides) an account of why a device for gener- 
ating truth conditions should be thought to specify meanings, as well 
as an account of the adequacy criteria to which such a theory of 
meaning is subject. His theory of meaning (broad sense) involves 
more than a simple identification of meanings with truth conditions, 
if indeed it involves such an identification at all. 

Davidson’s splitievel approach to meaning reflects his acceptance 
of Quine’s skepticism about reified “meanings.” For Quine, since 
there are no meanings—in the head or anywhere else—meaning is 
just whatever translation preserves. We therefore achieve an exhaus- 
tive understanding of meaning if we determine (a) what needs to go 
into an adequate translation manual between our own language and 
some alien tongue and (b) the constraints that guide and govern the 
construction of such manuals. Quine thinks that the constraints on 
translation fall far short of enforcing a unique choice of manual. 
Translation—hence meaning—is indeterminate. 

Davidson prefers “interpretation” to “translation”; and his 
methodological views diverge in some ways from those of Quine. 
But strategically his approach to meaning is the same. If we (a) deter- 
mine the theoretical form of the knowledge at which interpretation 
aims and (b) fix the methodology of interpretation, then we have 
said all there is to say about meaning. For Davidson, meaning just ts 
whatever the practice of interpretation reveals. 

We can now see more clearly why, when thinking about Davidson, 
we must be careful not to conflate the two uses of the term ‘theory of 
meaning’. Using the term narrowly, as Davidson himself often does, 
we refer to some axiomatic theory, a recursive device for specifying 
the meaning of every sentence of a given language or, more pre- 
cisely, of the current idiolect of a particular speaker. No such partic- 
ular theory constitutes an account of what meaning consists in, 
however, any more than Tarski’s constructions tell us why the met- 
alinguistic predicates they define are truth predicates. To determine 
Davidson's views about the nature of meaning, we must attend to his 
ideas about the constraints that particular theories of meaning must 
satisfy to be judged acceptable: his theory of interpretation. For clar- 
ity, I shall henceforth use the phrase theory of meaning exclusively in 
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the narrow sense. | shall refer to Davidson’s more general, interpre- 
tationist conception of meaning as his theory of interpretation. 

Making this distinction dispels the illusion that Davidson “explains 
meaning in terms of truth conditions.” Certainly, he uses truth talk 
in (narrow sense) theories of meaning. But the use is expressive: 
truth talk is used in the course of specifying particular meanings. 
Davidson would explain meaning in terms of truth only if he needed 
to appeal to some richer-than-disquotational notion of truth in his 
theory of interpretation, the level at which he explains what meaning 
is. But at this level, too, truth talk turns out to be no more than the 


expressive convenience that functional deflationism takes it to be. 
V. TRUTH AND MEANING 


For Davidson, a theory of meaning (narrow sense) takes the form of 
a Tarski-style truth definition. The instances of convention-T, the T- 
sentences, which emerge as theorems give the meaning of each indi- 
vidual sentence of the target language by stating that sentence’s 
truth condition. In “Truth and Meaning,” Davidson” advocates such 
a structure as the proper form for a theory of meaning because it 
avoids appeal to a nonextensional notion, such as ‘means that’. In- 
stead of requiring that a theory of meaning issue in theorems of the 
form ‘s means that P’ for every sentence S of the target language, he 
suggests treating the position occupied by ‘p’ extensionally, replacing 
‘means that’ by ‘if and only if’ and supplying the sentence-identifying 
description ‘s’ with some predicate proprietary to the theory of 
meaning. This yields theorems of the form: 


(T) Sis Tif and only if p. 


All we need now require of a theory of meaning for a language L is 
that, without appeal to any (further) semantical notions, it place 
enough restrictions on the predicate ‘is T’ to entail all sentences got 
from schema T when ‘S’ is replaced by a structural description of a 
sentence of L and ‘p’ by that sentence. Since the condition placed 
on satisfactory theories of meaning is essentially Tarski’s convention- 
T, “it is clear that the sentences to which the predicate applies will be 
just the true sentences of L” (ibid., p. 23). 

This argument seems to imply that a Tarskistyle truth definition 
could function as both a (partial) explanation of truth and an expla- 
nation of meaning (for a given language). In “Truth and Meaning,” 
Davidson embraces this implication. Thus: 


4 In Arguenes mio Truth and Interpretation (New York: Oxford, 1984), pp. 17-36. 
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There is no need to suppress...the obvious connection between a defini 
tion of the kind Tarski has shown how to construct, and the concept of 
meaning. It is this: the definition works by giving necessary and suffi- 
cient conditions for the truth of every sentence, and to give truth cond+ 
tions is a way of giving the meaning of a sentence. To know the 
semantic conception of truth for a language is to know what it is for a 
sentence—any sentence—to be true, and this amounts, in one good 
sense we can give to the phrase, to understanding the language." 


In later writings, however, Davidson argues that his position in 
“Truth and Meaning” embodies a confusion. His mistake, he says: 


...was to think that we could both take a Tarski truth definition as 
telling us all we need to know about truth and use the definition to de- 
scribe an actual language.” 


We cannot both treat a Tarski-style truth definition as an empirical 
theory of meaning for a natural language and as telling us about truth, 
because we cannot assume an ability to recognize what T-sentences to 
count as correct. This was not a problem for Tarski: he simply as- 
sumed that, in a T-sentence, the sentence to the right of the bicondt 
tional was a correct translation of the sentence whose truth conditions 
it gives. But for Davidson, the main problem is to explain the empirr 
cal basis for a Tarski-like theory's being taken to give correct truth 
conditions for the sentences of some target language. That basis must 
consist, at least in part, in our ability to recognize that certain alien ut- 
terances are true, antecedently to our attaining a detailed understand- 
ing of what they mean. But this means that, in constructing a theory 
of meaning, we rely on a prior understanding of truth. 

Davidson takes this argument to show that we face a fundamental 
choice: either we use the notion of truth to get at meaning or we ex- 
plain truth in ways that rely on the notion of meaning. He takes the 
first option. He writes: 


.. While Tarski intended to analyse the concept of truth by appealing (in 
convention-T) to the concept of meaning (in the guise of sameness of 
meaning or translation), I have the reverse in mind. I considered truth 
to be the central primitive concept, and hoped, by detailing truth’s 
structure, to get at meaning.” 


Davidson sees himself as turning Tarski’s theory on its head. In- 
stances of convention-T are no longer “partial definitions” of truth, 
which is how many deflationists see them: they are empirical hy- 


* “Truth and Meaning,” p. 24. 
"The Structure and Content of Truth,” p. 286, note 20. 
* “Introduction,” Inquwertes mio Truth and Interpretation, pp. mirxx, here p. xv. 
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potheses about the truth conditions for sentences of a given speak- 
er’s language, hypotheses which require a prior understanding of 
the notion of truth. A theory of meaning illuminates meaning by re- 
vealing, in a systematic way, how the truth condition of a sentence is 
a function of the semantic properties of its parts: its logical structure 
and the reference of its terms. 

Horwich accepts Davidson’s appraisal of the situation. Thus, if: 


..understanding a sentence...is a matter of appreciating what must be 
the case for the sentence to be true—knowing its truth conditon—...it is 
not possible that this knowledge also helps to constitute our grasp of ‘is 
true’. For in that case we would be faced with something like a single 
equation and two unknowns.... [K]nowledge of the truth condition of a 
sentence cannot simultaneously constitute both our knowledge of its 
meaning and our grasp of truth for the sentence.” 


Horwich thinks that Davidson is right in the options that he sees and 
wrong only in the choice that he makes. We should explain mean- 
ing some other way—in terms of proper use. Then we can give a de- 
flationary account of truth. 

At first sight, this line of argument seems compelling. I think that 
Davidson's early “Truth and Meaning” position is much more defensi- 
ble, however, than Davidson, in his later writings, allows. Are we really 
forced to choose between taking truth for granted so as to explain 
meaning or meaning for granted so as to explain truth? We must pro- 
ceed carefully here. Even if Davidsonian theories of meaning require 
some prior grasp of truth, it does not follow that the function of truth 
in such theories is to be explanatory of meaning. Accordingly, it does 
not follow that such theories could not tell us something about mean- 
ing and something (though not everything) about truth. 

A Davidsonian theory of meaning, for a particular speaker, is a de- 
vice for giving the meaning of every sentence in that speaker's idiolect. 
If we like, we can say that such a theory explains what that speaker's 
sentences mean. But to explain what particular sentences mean is 
not to explain what meaning is. A fortiori, it is not to explain what 
meaning is by appeal to truth. In this respect, Davidsonian theories 
of meaning are just like Quinean translation manuals. Translation 
manuals give translations; they do not explain what translation is or 
what makes one sentence a good translation of another. The illu- 
sion that Davidsonian theories of meaning are explanatory is fos- 
tered by loose talk about “explaining meaning in terms of truth 


* Truth, p. 71. 
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conditions” or “identifying a sentence’s meaning with its truth con- 
dition.” For Davidson, there are no such things as meanings, hence 
no question of identifying them with anything. Davidsonian mean- 
ing theories are more accurately described as specifing meanings or, 
better still, as interpreting the sentences of some target language. 

In specifying the meanings of sentences in another speaker's 
idiolect, we associate his sentences with sentences of our own. In do- 
ing so, we make use of the truth predicate, which is what lends color 
to the idea that Davidson explains meaning in terms of truth condi 
tions. But the use of ‘true’ in a Davidsonian meaning specification 
for a particular speaker is expressive, not explanatory. He needs 
‘true’ only because he eschews ‘means that’ in favor of the material 
biconditional. Making use of ‘means that’, we could specify mean- 
ings using such sentences as: 


“Le Président de France est fou” means that the President of France is 
mad’. 


But: 


“Le Président de France est fou” if and only if the President of France 
is mad’. 
is not even grammatically well formed. To replace ‘means that’ with 
a sentential connective, we need a sentence on the left side. An 
ideal candidate would be a prosentence referring anaphorically to 
the target-language sentence whose meaning we want to specify. 
This is precisely what ‘true’ allows us to form. The appeal to truth in 
a Davidsonian theory of meaning is not explanatory. 
VL REFERENCE RELATIONS AS THEORETICAL ENTITIES 

For Quine, correct translations are all and only those which satisfy 
the methodological constraints enshrined in the theory of transla- 
tion: there is no further fact of the matter, in respect of meaning, for 
translation manuals to be right or wrong about. With respect to 
meaning, Quine is a thoroughgoing instrumentalist. But so is David- 
son. Substituting ‘interpretation’ for ‘translation’, what we just said 
about Quine goes for Davidson, too. Davidson is not a verificationist 
in the theory of meaning (narrow sense). That is, individual theo- 
ries of meaning give meanings by way of truth conditions (in the in- 
nocuous sense indicated in the previous section): they make no 
menton of verification or assertibility conditions. But he is surely 
some kind of a verificationist about the subject matter of such theo- 
ries. This follows from his Quinean antirealism with respect to 
meanings. 
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This feature of Davidson’s thought about meaning is obscured by 
an analogy to which he appeals to clarify the way in which truth is his 
real explanatory primitive. This analogy also contributes powerfully 
to the illusion that Davidsonian theories of meaning are themselves 
explanatory. The analogy in question is between reference relations 
and the unobservables postulated by theoretical science. 

Davidson invokes this analogy in the course of responding to the 
suggestion that, since Tarski shows how to define truth in terms of 
reference, we could take the notion of reference as primitive. David- 
son rejects this thought on the grounds that the basic evidence for 
theories of meaning is given by sentences held true in observable cir- 
cumstances. We therefore decide what the basic terms of another 
person's language refer to by determining what assignment of refer- 
ents make his sentence held true come out to be actually true. This 
means that in constructing a theory of meaning, even though it takes 
the form of a Tarskian theory of truth, we make use of a notion of 
truth that is not defined by the theory. The crucial point here, as 
Davidson points out, is not “to confuse the order of explanation that 
is appropriate once the theory is in place with the explanation of 
why the theory is correct.” Thus: 


The theory is correct because it yields correct T-sentences; its correctness 
is tested against our grasp of the concept of truth as applied to sentences. 
Since T-sentences say nothing about reference, satisfaction, or expres- 
sions that are not sentences, the test of the correctness of the theory is in- 
dependent of intuitions concerning these concepts. Once we have the 
theory, though, we can explain the truth of sentences on the bass of 
their structure and the semantic properties of the parts. The analogy 
with theories in science is complete: in order to organize and explain 
what we directly observe, we posit unobserved or indirectly observed ob- 
jects and forces; the theory is tested by what is directly observed.” 


This makes brilliantly clear how, for Davidson, reference in one way 
determines truth while at the same time, in another way, it is deter- 
mined by it. A Davidsonian theory of meaning, while formally com- 
positional, is methodologically holistic. As he says, 


If sentences depend for their meaning on their structure, and we un- 
derstand the meaning of each item in the structure only as an abstrac- 
tion from the totality of sentences in which it features, then we can give 
the meaning of any sentence (or word) only by giving the meaning of 
every sentence (and word) in the language.” 


” “The Structure and Content of Truth,” p $00. 
u “Truth and Meaning,” p. 22. 
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But because Davidson treats the reference of a term as “an abstrac- 
tion from the totality of sentences in which it features,” reference 
has no life of its own, independent of the truth of sentences. The 
analogy with the theoretical entities postulated by physicists is thus 
misleading. The analogy implies that the dependence of reference 
on linguistic behavior as a whole is merely epistemological, and that 
is not Davidson's view. Reference relations are not independently 
existing objects that explain observable features of a person's linguis- 
tic behavior. Rather, they are constituted by their role in making a 
person’s utterances, or most of them, come out true. They have no 
further properties for us to speculate about. In this respect, they are 


quite unlike electrons. 
VI. TRUTH AND INTERPRETATION 


We have seen that Davidsonian theories of meaning (narrow sense) 
are specifications of meaning. As such, they make essential use of 
the expressive power of the truth predicate. But they do not explain 
what meaning is: that is the task of the theory of interpretation, 
which articulates the constraints that determine when such meaning 
specifications can be regarded as correct. The question, therefore, 
is: Must Davidson take truth as an explanatory primitive in the the- 
ory of interpretation? 

Davidson says that, if we ask “What is missing, as an account of 
truth, in a theory of truth in Tarski’s style?” the answer is “the con- 
nection with users of language.” This is just the question of how we 
decide “whether a theory of truth is true of a given language.” But 


since all of us do understand some speakers of some languages, all of us 
must have adequate evidence for attributing truth conditions to the ut- 
terances of some speakers; all of us have, therefore, a competent grasp 
of the concept of truth as applied to the speech behavior of others.” 


But what does this grasp consist in? If we know how to interpret, we 
can extend our purely disquotational notion of truth to alien sen- 
tences, which would appear to give us just what Davidson demands. 
So the question remains: Why not give truth to the deflationists and 
let interpretation stand on its own feet? The only objection to so do- 
ing would have to be that the canons of interpretation themselves 
make use of a rich notion of truth. I do not believe that they do. 
Davidson’s views about interpretation can be seen as conse- 
quences of his holism. Thus, the first constraint on interpretation is 


= “The Structure and Content of Truth,” p. 301. 
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that we must find semantic structure—logical form-—in the sen- 
tences of our target language. This is, of course, an immediate con- 
sequence of holism, which treats the meaning of individual words as 
an abstraction from their contribution to the meanings of all the 
sentences in a person’s language. But to see words as making a sys- 
tematic contribution, we must be able to identify the logical form of 
the sentences in which they occur, and to see sentences as standing 
in logical relations to one another. According to Davidson, this is 
how interpretation differs from translation. 

Davidson refers to finding logical form in someone’s utterances as 
locating “the kind of pattern truth must make” and he thinks that 
Tarski’s contribution is to have shown just what that pattern must be. 
But what Tarski showed is how to define truth in terms of reference 
for languages formalized in first-order logic. So in practice, finding 
the pattern that truth must make means using standard first-order 
logic as the basis for a theory of the logical forms of sentences of nat- 
ural languages. This will involve showing how seemingly recalcitrant 
sentences—such as those involving adverbial modification—can be 
accommodated within this general structure. A commitment to find- 
ing this particular kind of logical structure in the sentences of natural 
languages has, however, no essential connection with a nondeflationist 
view of truth. We could say that adopting a particular logic amounts to 
adopting a particular—and so richer-than-disquotational—concept of 
truth. But we do not have to; and nothing in Davidson’s view of mean- 
ing, which requires only that we find in natural languages the struc- 
ture of first-order logic, mandates this further step. Davidson's theory 
of interpretation privileges first-order logic. It does not, however, re- 
quire that we build our logical preferences into our concept of truth. 

The second constraint is charity: we must take the beliefs of any- 
one we are trying to interpret to be, on the whole, true. What no- 
tion of truth is involved here? 

In his early writings, Davidson sees two dimensions to charity. One 
dimension—consistency—follows from the need to find logical struc- 
ture. Logical relations between sentences are our clues to logical form. 
But such clues would be useless in the face of massive illogicality. More 
deeply, in the face of such illogicality, there would be no pattern of log- 
ical interrelationships, hence no meaning, nothing to discover. But 
this imports no richer notion of truth than is already involved in the 
need to read a logic into the language of our interlocutor. 

What about the meaning of nonlogical terms? Given holism, the 
meaning of a person’s words has to be abstracted from sentences he 
holds true, that is, from his beliefs. But we cannot know what his be- 


MEANING AND DEFLATIONARY TRUTH 561 


liefs are unless we can understand his words. Meaning and belief are 
thus interdependent, another equation in two unknowns. The way to 
solve the problem of the interdependence of belief and meaning is by: 


.. holding belief constant while solving for meaning. This is accom- 
plished by assigning truth conditions to alien sentences that make na 
tive speakers right when plausibly possible, according, of course, to our 
own view of what is right What justifies this procedure is the fact that 
agreement and disagreement alike are intelligible only against a back- 
ground of massive agreement.” 


This is the second dimension of charity: we have to use our own be- 
liefs as the basis for interpreting the beliefs of others. Clearly, how- 
ever, the key notion here is agreement, not truth. Indeed, Davidson 
himself often explains charity in just these terms, as when he says 
that the crucial point is that “the correct interpretation of an agent 
by another cannot intelligibly admit certain kinds and degrees of dif- 
ference between interpreter and interpreter with respect to belief.”™ 
This says everything that needs to be said, without truth getting so 
much as a mention. 

To be sure, Davidson does also formulate his key methodological 
maxim in ways that make use of the notion of truth. Thus: “Making 
sense of the utterances...of others...requires us to find a great deal of 
reason and truth in them.”” It is clear, however, that the concept of 
truth, as it occurs here, bears no explanatory load. Charity requires 
us to attribute to the alien the maximal possible number of beliefs 
that we ourselves hold true. So our maxim is: for the most part, do 
not attribute to the alien the belief that p unless p, the belief that q 
unless q, and so on. This maxim employs ‘true’ as the generalizing 
device that deflationists say it is. 

Davidson thinks that deflationists sever all links between meaning 
and truth. But to the extent that such a link is enforced by charity, 
this is simply not so. Nothing prevents a deflationist about truth from 
holding that interpretation requires attributing to other people com- 
mitment to lots of truths. And this is all charity involves. To find a 
great deal of truth in the utterances of another is Just to find among 
those utterances lots of truths. Since, inevitably, these will be truths 
“by our own lights” or “according to our own standards,” being chari- 
table involves only what Field calls “mapping lots of alien sentences 


* “Radical Interpretation,” Ingqumes mto Truth and Interpretation, pp. 125-39, here 
p. 137 

= “The Structure and Content of Truth,” p. 319. 

™ “Belef and the Basis of Meaning,” in Inqutnes mto Truth and Interpretation, pp. 
141-54, here p. 158 
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onto disquotationally true sentences that we ourselves understand” 
(op. at, p. 408). Charity can rejoice in truth without being puffed up. 

There is one final aspect of Davidson’s views to consider. To fix 
the interpretation of nonlogical terms, charity as consistency-plus- 
truth-maximization is not enough: we need a constraint on interpre- 
tation that connects language with the world. Here, Davidson 
follows Quine. The fundamental evidence to which interpretation is 
responsive is given by occasion sentences that are responsive to read- 
ily detectable events in our surroundings: the passing by of a rabbit 
and so on. As Davidson himself remarks, the distinction between 
“sentences whose causes to assent come and go and those a speaker 
clings to through change” is all that “offers the possibility of inter- 
preting the words and sentences beyond the logical.”* So we have a 
third dimension of charity: attributing true beliefs in certain 
methodologically basic cases. 

Is there a rich notion of truth here? Again, no. As before, the rel- 
evant interpretive maxim involves a generalizing rather than an ex- 
planatory use of ‘true’. With respect to observation sentences, the 
rule is: interpret ‘Gavagai’ as ‘Lo a rabbit’ if there is a rabbit passing 
by; or in general, make the interlocutor’s observation sentences 
come out true. Notice, incidentally, that, in the case of observation 
sentences, nothing distinguishes Davidson's theory of interpretation 
from an approach to meaning in terms of assertibility conditions. In 
the case of observation sentences, assertibility conditions include 
truth conditions; and to state such assertbility conditions, we only 
need ‘true’ as a generalizing device. 

Let us return to Horwich’s suggestion that deflationists, since they 
cannot explain meaning in terms of truth, should explain it in terms 
of use. IfI am right that the role of truth in Davidsonian theories of 
meaning is expressive rather than explanatory, and if it turns out 
that the notion of truth plays no essential explanatory role in the 
theory of interpretation either, Davidson may be closer to Horwich 
than either thinks. Or I should say “used to think,” for in his latest 
writings Horwich” has come round to the view that “Davidson’s Prin- 
ciple of Charity arguably boils down to the use theory of 
meaning” (ibid., p. 72). I think Horwich is right: it does. 

The theorems of a Davidsonian theory of meaning are material 
biconditionals. Since such biconditionals will be true provided that 
they pair true sentences, it is natural to wonder how they can be re- 


= “A Coherence Theory of Truth and Knowledge,” in Lepore, ed., Truth and 


Interpretahon, pp 307-19. 
7 Meantng (New York: Oxford, 1998), p 72. 
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garded even as specifying meanings. Given a material reading of 
the biconditional: 


(S) “Snow 1s white” is true if and only if grass is green’ 


is true, but ‘Snow is white’ does not mean ‘Grass is green’. But this is 
why Davidson embraces holism. (S) is not absurd in itself. It is just 
that we would be entitled to come up with (S), in an attempt to un- 
derstand a fellow speaker, only if we could construe lots of that 
speaker's other utterances, subject to the multiple constraints of 
charity: in particular, those relating to his observational uses of 
‘snow’, ‘white’, and ‘green’. Sentences giving “truth conditions” can 
be regarded as specifying meanings only in the context of a scheme 
for interpreting an entire language. Seen in this light, Davidson’s 
view of meaning does not stand in a simple relation of opposition to 
what Horwich conceives of as a use theory. As Horwich puts the 
point: “Implicit in...the Principle of Charity...are the following ideas: 
(1) that what a person holds true is determined by various factors, in- 
cluding the observable circumstances, background beliefs, inferen- 
tial propensities, and basic regularities of use of words—regularities 
that specify, as a function of the other factors, which sentences are 
held true; and (2) that a good assignment of truth conditions should 
optimize agreement in the sense of preserving these basic regulari 
ties."™ And since, by Davidson’s lights, the reference of a speaker's 
terms is no more than an abstraction from the totality of the sen- 
tences he holds true, reference itself is a matter of contribution to or 
function within a belief system, as manifested in patterns of assent 
and dissent, special attention being paid to occasion sentences. In a 


word, reference is a matter of use. 
VIL CONCLUSION 


What do the T-sentences that are the theorems of a theory of mean- 
ing for a particular language or speaker tell us about: truth or mean- 
ing? To this question the final answer is “both.” 

They tell us something about truth: they fix the extension of ‘true’ 
as applied to alien sentences, taking us from pure to extended dis- 
quotational truth. They do not, however, “explain what truth is.” 
They presuppose a grasp of the function and mechanism of truth in 


” Ibid. Despite this take on chanty, Horwich continues to think of the problem 
of ruling out aberrant T-sentences of the “Snow 1s white” us true if and only if grass 
is green’ variety as a possibly insuperable problem for Davidson Perhaps he reads 
too much into the claim that, for Davdson, an utterance’s truth condition “const 
tutes its meaning”—see, for example, tid., p. 168, footnote 11. 
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our home language, exploiting without explaining the expressive 
function that truth talk serves. 

Davidson's account of when such theories are to be counted cor- 
rect makes no appeal to notions like “objective interpersonal syn- 
onymy.” Our “meaning theory” for a given speaker undergoes 
constant modification and so is really a series of “passing theories.”* 
Davidson therefore respects Field’s constraint that a deflationist 
should treat “‘good translation’ as a highly context-sensitive and in- 
terest-relative notion.”** For Davidson, ‘correct’ here means accept- 
able. It does not and should not mean ‘uniquely acceptable’. 
Davidsonian theories of meaning offer a way of moving from pure to 
extended disquotational truth which meets Field’s standards for be- 
ing (theoretically) deflationary. 

The theorems of a Davidsonian theory of meaning also tell us 
something about meaning: they specify the meanings of the sen- 
tences of the target language by pairing them systematically with sen- 
tences antecedently understood. But they do not explain what 
meaning is: that is accomplished by the theory of interpretation, 
which explains what makes such a systematic pairing acceptable. 

There is no circularity here: meaning explaining truth and truth 
explaining meaning. Rather, Davidsonian theories of meaning de- 
rive their power to tell us something about meaning and something 
about truth from the theory of interpretation, in which neither 
meaning nor truth figures as a primitive notion. Davidson thinks 
that he explains meaning in terms of truth. But what he really does 
is illuminate both meaning and truth by appeal to interpretation, an 
activity that can itself be understood without extensive theoretical re- 
sources. This is why he can afford to be a deflationist. For someone 
like me, attracted to both a Davidsonian approach to meaning and a 
deflationary approach to truth, this 1s a welcome result. 

MICHAEL WILLIAMS 
Northwestern University 


™ See Davidson, “A Nice Derangement of Epitaphs,” in LePore, ed., Truth and 
Interpretaton, pp. 125-39. 
* Field, p. 408. 
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Imost all epistemologists are nonskeptics: they accept that 
people have knowledge. Almost all epistemologists say, too, 
that most—maybe all—human knowledge is fallible. But 
what is it for an instance of knowing to be fallible? The usual answer 
will be something like this 
Falk: one’s knowledge that p 1s fallible, if and only if (1) one knows 
that p but (2) one’s justification for one’s belief that p 1s compatible 
with one’s being mistaken ın believing that p.’ 


Yet this is not a wholly satisfactory characterization. No belief ina 
necessary truth can be mistaken—because no necessary truth can be 
false. So Falk entails that no such belief could be fallible knowledge; 
and that seems like a mistaken result itself. One’s justification for a 
belief in a necessary truth could be less than perfect, and this should 
make the belief fallible knowledge at best—even though this justifi- 
cation is incompatible with the belief being mistaken. Should we 
therefore broaden FalK, so that a piece of knowledge that p can be 
fallible because of something other than p’s possible falsity (in some 
pertinent sense of ‘possible’)? 

In my experience, epistemologists will be reluctant to do that. Fal 
libility (they will say) is definitionally tied to a possibility of being 
mistaken. The problem with that reluctance to seek a broader analy- 
sis of fallible knowledge, however, is that it can prevent our noticing 
a more general kind of epistemic failing—in fact, the kind of failing 
that underlies the traditional concept of knowing fallibly. Were it 
not for that reluctance, I would be happy to extend the term ‘fallible 
knowledge’ to this more general kind of failing. Given that reluc- 
tance, however, I shall call this more general phenomenon epistemic 
fatlahtlity—a way of knowing which generates what I call faslable 
knowledge. And it will be the subject of this paper.’ 


* I am grateful to Scott Campbell, Richard Feldman, Tim Oakley, Philip 
Staines, and Konrad Talmont-Kaminski for their helpful comments on an earlier 
version of this paper 

' See, for example, Stewart Cohen, “How To Be a Fallibilist,” in James E. 
Tomberlin, ed., Philosophical Perspectrves, Volume 2 (Atascadero, CA: Ridgeview, 
1988), pp. 91-123, here p 91. 

* Philosophers use the term ‘fallibilism’ in two ways-—~to denote fallible knowt 
edge, or to refer to human cognitive fallibility in general. See, for instance, Susan 
Haack and L. S. Carrier in, respectively, “Fallibilism and Necessity,” Synthese, XLI, 1 
(May 1979): 37-68, and “How to Define a Nonskeptical Fallibilism,” 

XXI, 3-4 (December 1993). 361-72. I shall eschew the term ‘fallibiism' and its cog- 
nates, as used in the second of these two ways Instead, I shall be generalizing the 


first of those two ways 
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I. FAILABLE KNOWING IN GENERAL 
(A) The underlying motivation for the usual epistemological analysis 
of what it is to know fallibly is clear. The motivation is the fact that 
possible falsity is a possible lack of knowledge. Because knowledge is 
of only what is true, a given p's being false entails that no one knows 
that p. So, any belief that p (for that given p) could have fatled to be 
knowledge—even when, in fact, it is knowledge. 

With that observation, though, we are already on our way to a 
broader, hence more widely applicable, conception of knowing-in-a- 
lessthan-ideal-way than is provided by the usual conception of falli- 
ble knowing. Instead of restricting ourselves to an analysis of fallible 
knowing, we should focus on this more general idea: 


FailK1: one’s knowledge that p 1s failable, if and only if (1) one knows 
that p but (2) one might have failed to do so. 


Knowing fatlably entails possibly not knowing. And knowing faltbh 
entails knowing failably: if one’s belief that p is knowledge yet it could 
be mistaken, then it is knowledge which could fail to be knowledge, 
and hence it is knowledge which might not have been knowledge. I 
shall accept FailK] as a minimal—formal, generic, schematic—condit 
tion to be satisfied by any more specific account of failable knowing. 

(B) In order to find a more specific instantiation of FailK1, we 
need only describe the various routes to a possible lack of knowledge 
(the possible lack referred to by FailK1(2)). Possible falsity is one of 
them, but there are also other ways possibly to lack a piece of knowl 
edge. Let us assume that knowing that p involves having at least a 
welljustified true belief that p. That being so, one lacks knowledge 
that p if p is false, or if one does not believe that p, or if one lacks 
good justification for the belief that p. Consequently, there are at 
least the following three examples of ways in which one might not 
have known that $. 

(i) It could be the case that one’s evidence for pis good, but not so 
good as to be incompatible with p's being false. In that event, p might 
not have been true, even if it is true and even if the evidence for its 
truth is itself true. This is a way for knowing to be failable due to one’s 
relying on nondeductive justification—that is, due to its being fallible. 

(ii) It could be the case that one believes that p, but not so conft 
dently or firmly that one might not have decided not to have that be- 
lief, even though the belief is true and even if in fact the belief is 
produced in response to one’s good evidence for its truth. This is a 
way for knowing to be failable due to one’s belief being present in 
too fragile a way, let us say. 
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(iii) It could be the case that even though one has good evidence 
for one’s true belief that p and even though one does base one’s belief 
that p upon that evidence, one might have had that true belief anyway, 
even without that evidence for it. This is a way for knowing to be fail 
able due to one’s belief being present in too dogmatic a way, let us say. 

Each of (1)-(iii) describes a minimal possible way to fail to know 
that p. Each is minimal, because each describes one’s lacking Just 
one of the three features described so far as being required for 
knowing. In (i), truth is absent (p is false), while the belief that p 
and good justification for that belief are present. In (ii), the belief 
that pis absent, while pis true and one has good justification for that 
belief. And in (iii), one lacks good justification for the belief that p, 
while having a true belief that p. 

Of course, one’s Knowing on a given occasion would also be ab- 
sent if one lacked even more than just one of those three features 
(and of any other features that knowledge requires, if there are any); 
subsection (D) will discuss that kind of situation. But our first need 
here is for a core conception of knowing failably. That is why I have 
outlined some minimal possible ways to fail to know that p. What 
will prove in section N to be a useful heuristic with which to sum up 
the core analysis | am about to present, then, is this: to know failably 
that pis to know that p even while coming at least a little close to not 
knowing that p. More formally: 


FailK2: a person x knows failably that p, if and only if (1) x knows that 
p, and (2) there is an accessible possible world where (i) p is false 
(but x believes that p, with the same good evidence for p as he has 
here), or (ii) x fails to believe that p (even though pis true and he has 
the same good evidence for p as he has here), or (iii) x fails to have 
the same good evidence for pas he bas here (but he still believes that 
f, and p is true)? (And an accessible possible world, as regards x's 


> Should clause (ui) be the following more sweeping condition instead? 

x fails to have good evidence for p (but he still believes that p, and is true) 
I do not believe that it should be I am discusmng whether a particular piece of 
knowledge that p is failable From FailK1, it is failable if it could be absent (while 
the person 1s still present); and a minimal way for it to be absent is for there to be 
an accessible posmble world where the person’s true belief that p is supported by 
different good evidence. For that is a world where at best the person has a differ- 
ent case of knowmg that p. From FailK1, to know mfailably on a given occasion us 
for that knowing—characterized by whatever features make it the instance of 
knowing it 1s—~to be infallable. And the features that make it the instance of know- 
ing it is include at least the particular belief it is, and that beliefs being true, along 
with the particular good evidence for it. Hence, if there is to be identity of know- 
ing across worlds, then at least the same belief and truth and good evidence are re- 
quired in each such world. (But if you reject transworld identity of knowing, you 
might replace clause (ii) with the suggested alternative. That would affect some 
subsequent parts of this paper in detail but not prinaple ) 
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knowing that p in this world, is a world which contains at least two 
members of this set {p's being true, x’s believing that p, x’s having 
the same good evidence for pas he has in this world}.)‘ 


FailK2 entails that for x to know infailably that p is for there to be 
not even a passththty of x lacking that knowledge. And FailK2 de- 
scribes three minimal ways to instantiate that possibility. I shall refer 
to worlds containing those instantiations as being epistemic-failure 
worlds for x in relation to p.* Failable knowledge is knowledge for 
which there is a possible epistemic-failure world. 

(C) Is my account of infailably knowing really an account of neces- 
sarily knowing? After all, the account begins with FailK1, which says 
that infailable knowing is knowing which is not such that it might 
not have existed. That is (it seems), one’s infailably knowing that p 


is one's necessarily knowing that p—where one’s necessarily know- 
ing that p is either (i) in every possible world, one’s knowing that p, 
or (ii) in every possible world where one exists, one’s knowing that p. 

In fact, FailK1 does not have that commitment. FailK] has led to 
FailK2—-an account which describes infailable knowledge’s infail- 
ability component in this way: 


In each world where one exists and where one has two of the elements 
of knowing that p, one also has the third element‘ 


Necessarily, to be that close to knowing that p ts to know that p; at 
least it is, when the knowing is infailable. That is what FailK2 


‘ Any epistemic-failure world for xin relation to p is thus an accessible world as 
Ten x's knowing that p (Hence, when I talk below of epistemic-faulure worlds, 
I not need to make iat that they are accessible worlds. And because any 
world that satisfies one of (1)-{ili) in FailK2(2) is an epistemic-failure world for xin 
relation to p, it is also an accessible world, as regards x's knowing that p. So, 
strictly, there was no need to say, as Fai[K2(2) does, that it is referring to accessible 
worlds.) But the converse implication fails, because if x knows infailably that p 
then any world containing at least two members of the set {p's being true, x's be- 
lieving that ~, x's having the same good evidence for pas he has here} also contains 
the third member of the set. That 1s, if x knows infailably that ġ, then none of the 
pertinent accessible worlds is an epistemuc-farAere world for xin relation to a 

* Implicit ın this, too, is a relatrvization to a particular time—a world being an 
epistemic-failure world for xin relation to p at time i. But, for simplicity and be- 
cause no unclarity will result, I ignore this extra relatvization in what follows. 

* “Does knowing not have more than three elements—truth, belief, good justifi- 
cation, and something else? Did Gettier not teach us that?” If section 0 below is 
correct, we do not have to agree that Gettler did that (And even If section N is 
wrong, the present discussion can be expanded to accommodate whatever else is 
involved in knowing.) 
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implies—clearly, a weaker commitment than is involved in neces- 
sarily knowing.’ 

The usual analysis—FalK—of fallible knowledge that p describes 
it as being knowledge that p which could be absent via the truth that 
p being absent, even when the belief that p and the good jusufication 
for that belief are present. In my terms: it describes fallible knowl 
edge that p as knowledge that p which is compatible with (i) being 
that close to knowing that p, yet (ii) still not knowing that p. I am 
generalizing that conception of fallible knowledge to obtain the con- 
cept of failable knowledge—so that failable knowledge that p is 
knowledge that p which could be absent via any one of the truth that 
p, the belief that p, and the good justification for that belief being 
absent, even when the other two are present. Can a person be that 
close, in any of those respects, to knowing that p—yet without know- 
ing that p? The distinctive feature of one’s infailably knowing that p 
is its being impossible for one not to know that p once one is so close 
to having the knowledge that p. 

(D) Although infallibility is absolute, fallibility is not. It can be 
more or less. We talk of a person’s being more, or his being less, fal- 
lible, at any rate: he is more, or he 1s less, likely to be mistaken, even 
given his evidence. The same way of talking is appropriate for epis- 
temic failability. A person can be more, or he can be less, epistem? 
cally failable: he is more, or he is less, likely to fail to know various 
facts. And can that qualitative or comparative dimension of epis- 
temic failability be applied to a particular instance of knowing? Can 
one person know that p more failably than another person does? Can 
one know that p more failably today than yesterday? 

That certainly can occur, at least in principle. If (as FailK2 tells us) 
x’s knowing that p being infailable involves there being no epistemic- 
failure worlds for xin relation to p, then x’s knowing that p being more, 
or its being less, failable involves (respectively) there being more, or 
there being fewer, and there being closer, or there being more distant, 
epistemic-failure worlds for x im relation to p. More fully: 

” Haack presents dogmatism -—the contradictory of what she calls fallibilism—as 
saying that “there are some propomtions which we can’t fail to beheve and which are 
such that it follows from our believing them that they are true” (op at, p 52). 
Adapting that to our present concerns gives us this: 

x's knowing infallibly that p includes (1) x’s believing that p in every world 

where x exists, and (ii) p's being true in every world where x believes that p 
That us, in every world where one exists, one has a true belief that p—+this being so, 
for any p which one infallibly knows. My account of infailable knowing 1s less de- 
manding than that account of infallibly knowing I require that, if one is mfailably 
to know that p, then each world where one exists and which includes at least two 
members of this set-—{ p's being true, one’s believing that p, one’s having good evi- 
dence for that belief}—-also includes the third member. 
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FailK3: a person x knows more, or x knows less, failably that p, if and 
only if (1) x knows that p, and, respectively, (2) there are more, or 
there are fewer, and there are closer, or there are more distant, posst 
ble worlds where (i) p is false (but x believes that p, with the same 
good evidence for p as he has here), or (ii) x fails to believe that p 
(even though pis true and he has the same good evidence for p as he 
has here), or (iii) x fails to have the same good evidence for p as he 
has here (but he still believes that p, and pis true). 


Exactly (or even approximately) how failable a particular piece of 
knowledge is will be a function of the pertinent epistemic-failure 
worlds, the worlds that satisfy clause (2) of FailK3. Correlatively, 
whether one piece of knowledge is more failable than another is also 
a function of those worlds. If x knows that p and knows that g, but 
there are more close epistemic-failure worlds for x in relation to p 
than in relation to q, then—-other things being equal—.x’s knowl 
edge that pis more failable than his knowledge that ¢.* 

For example, right now I know failably that I am typing. I also 
know failably that 13 x 14 = 182. As far as I can tell, though, the for 
mer piece of knowledge is more failable than the latter one. In 
rather rough outline, here is why that is so. 

(a) There are possible worlds where I am not typing even though I 
believe that I am and even though I have the same sensory evidence as 
I have here for my typing. (In those worlds, I am hallucinating, say.) 
There are worlds where I am typing and where I believe that I am, but 
where my evidence is not as itis here. (In those worlds, I am sensorily 
deprived, yet I remember intending to be typing now, say.) There are 
worlds where I am typing and where I have the same sensory evidence 
as here, but where I fail to believe that I am typing. (In those worlds, I 
distrust that evidence, say.) These are thus three kinds of epistemic- 
failure world for me in relation to the fact of my typing. 

(b) There are possible worlds where I have the same mathematical 
evidence as here for the fact that 13 x 14 = 182, but where I fail to 
believe that 13 x 14 = 182. (In those worlds, my caution about such 
calculations prevents me from having that belief, say.) There are 
worlds where I have the belief without that evidence. (In those 


* Could x also know that p more failably on one occasion than on another? If so, 
would his more falable knowledge that p be leas good, purely qua knowledge that 
p. than his less failable knowledge that p? The idea that propositional knowledge 
can admit of such qualitatrve gradations is rarely countenanced by epustemologists. 
I have accepted it elsewhere —*“The Sceptic Is Absolutely Mistaken (As Is 
Dretske),” Philosophscal Papers, xxvo, 1 (March 1998). 29-43. David Lewis has also 
endorsed that idea in “Elusive Knowledge,” Australanan Journal of Philosophy, LXXIV, 
4 (December 1996): 549-67, here pp. 562-63. 
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worlds, I remember being told that 13 x 14 = 182, say.) But there 
are no worlds where 13 x 14 # 182. 

All else being equal, then, there are more epistemic-failure worlds 
for me in relation to the fact of my typing than for me in relation to 
the fact that 13 x 14 = 182. I have distinguished between three 
kinds of epistemic-failure world. We could call them (in turn) truth 
failure worlds, belief-failure worlds, and jsusttfication-failure worlds. 
For me in relation to the fact of my typing, there are truth-failure 
worlds, belieffailure worlds, and justification-failure worlds. For me 
in relation to the fact that 13 x 14 = 182, there are only belief-failure 
worlds and justification-failure worlds—no truth-failure worlds. 
Therefore, all else being equal, my knowledge of my typing is more 
failable than my knowledge that 13 x 14 = 182.° 

Il FAILABLE KNOWING WITHIN GETTIER CASES 

(A) If failable knowing can be more or less failable, then in principle 
there can be instances of failable knowing which are very failable. 
These would be instances where the person is not just a little close to 
failing to know, instead, he is very close to lacking the knowledge. 
There are simple ways for that to occur (for example, I look up at 
just the right time to see, and hence know, who was running away 
from the robbery). But there are more surprising ways, too, in 
which knowing can be like that. The phenomenon of Gettier cases” 
supplies us with one such example. 

All epistemologists know (albeit failably) that such cases are regarded 
by all epistemologists as ones where there 1s a lack of knowledge." Re- 
gardless of that uniformity, I wish to outline an alternative way of mter- 
preting the cases, one according to which there is knowledge within the 
cases—failable knowledge, indeed very failable knowledge. 

Indisputably, each Gettier case contains some significant luck on 
the part of the case’s epistemic subject. His evidence has led him to 
a true belief that p; and the evidence is, in itself, good evidence for 
that belief. Nevertheless, each case also contains what we may calla 


* And is all else equal? It seems to me that I am more likely not to be typing, 
even when I think that I am doing so, than I am to suspend my simple mathematt 
cal beliefs when I engage in apparently normal calculations. In other words, the 
truth-failure worlds for me in relation to the fact of my typing are closer posible 
worlds than are the belieffailure worids for me in relation to the fact that 18 x 14 
= 182. If so, then on this score, too, my knowledge of my typing 1s more failable 
than my knowledge that 1$ x 14 = 182. 

4“ Such cases either are, or trace their ancestry to, those found in Edmund Gettier's 
modern classic, “Is Justified True Belief Knowledge?” Analpns, xxu, 6 (June 1968): 
121-23. I shall assume some familtanty with such cases on the part of the readers. 

U The conventional view of Gettier cases is that each contains an epistemic sub- 
ject who has a belief which is true and well sustified but not knowledge. 
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strange occurrence. And the presence of the strange occurrence is why 
the epistemic subject is somewhat lucky to have his well-justified true 
belief. That luck is manifested in one of the two following ways. 

(1) Helpful Gettier cases. Some Gettier cases contain a strange occur- 
rence which functions helpfully. If it had been absent,” then (all else 
remaining equal) the case’s epistemic subject would not have had his 
well-justified true belief. He would have lacked the truth or the be- 
lief or the good evidence. He would not have been even as close as 
he is to having knowledge. For instance, Roderick Chisholm’s” 
Sheep Case is like this. If there had not been a sheep in the field, 
hidden from his view, then (all else being equal) the epistemic sub- 
ject’s belief that there is a sheep in the field would not have been 
true. (It would have been supported by the same good but mislead- 
ing sensory evidence as at present, seemingly of a sheep, really of a 
disguised dog.) Edmund Gettier’s own cases are most likely helpful 
Gettier cases, too (op. ai.). In the first of those cases, for instance, if 
Smith had not been the person who will get the job, then (all else be- 
ing equal) it is probable that his belief that the person who will get 
the job has ten coins in his pocket would have been false. (Because 
all else is equal, Jones still does not get the job; hence, someone other 
than Smith and Jones does. And it is probable that whoever does get 
the job will not have ten coins in his pocket.) Those are just two ex- 
amples, but they are examples of a clear pattern: in each helpful Get- 
tier case, it is true that, if the case’s strange occurrence was absent, 
then-—all else being equal—the case’s epistemic subject would be in 
a close possible world where the truth that p or his belief that p or his 
good justification for p would be absent. In short: he would be in a 
pertinent and close epistemic-failure world for himself in relation to 
p. With the strange occurrence being present, though, the epistemic 
subject is not in an epistemic-failure world for himself in relation to p. 
(The strange occurrence plays a crucial part in bringing about his 
welljustified true belief.) 

(2) Dangerous Gettier cases. Each remaining Gettier case contains a 
strange occurrence which functions as a hidden danger, an unseen 
threat, to the epistemic subject’s having his well-justified true belief. 
If the strange occurrence had been absent, then (all else remaining 


" Sometimes the strange occurrence is conjunctive, involving a few circum- 
stances occurring. Always, though, at least part of the conjunction is vital to the 
case, in the way I am about to describe. 

d Theory of Knowledge (Englewood Cliffs, NJ: Prentice-Hall, 1989, third edition), 
p. 93. 
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equal) the epistemic subject would not have been in any real danger 
of not having his welljustified true belief. He would have had his 
welljustified true belief (as he does already) —but in a wholly nor- 
mal way (as he does not do already, within the Gettier case). He 
would have been as close as that can take him to having knowledge 
in a normal way. For instance, Carl Ginet’s Fake Barns Case" is like 
this. Henry is looking at a real barn, just as he would be doing if 
there were no fake barns nearby. Of course, there are fake barns 
nearby: that is the case’s strange occurrence. But now imagine that 
occurrence’s absence. If there were no fake barns nearby, then (all 
else being equal) there would be no imminent danger of Henry’s be- 
ing deceived by a fake barn into believing that he is seeing a barn. 
Because all else is equal, he would still have that belief; and it would 
still be true, since it is a response to a real barn in front of him. The 
belief would still be well justified, too, by his standard sensory evi- 
dence for it. Accordingly, with the absence of the fake barns and 
with all else being equal, Henry would have a belief which is true and 
well justified in a normal way. Nevertheless, in this case—as in each 
dangerous Gettier case—the case’s strange occurrence threatens to 
interfere: there are close possible worlds where it does interfere, 
these being worlds where the truth that p or the epistemic subject's 
belief that p or his good justification for p are absent. For example, 
there is a close possible world where Henry ts deceived by a fake 
barn. That is, there is a pertinent and close epistemic-failure world 
for Henry in relation to his seeing a barn.” 

In each Gettier case, therefore, the strange occurrence brings it 
about that the epistemic subject almost fails to have his welljustified 
true belief. In a helpful Gettier case, he has a welljustified true be- 
lief, in part due to the occurrence of the strange occurrence——but, 
by the same token (the strangeness of the strange occurrence), he al- 
most fails to have the well-justified true belief. In a dangerous Get- 


= The case comes to us via Alvin Goldman—“Discrimination and Perceptual 
Knowledge,” this JOURNAL, LXXM, 20 (November 18, 1976): 771-91 

» We should not infer from these examples that epustemicfailure worlds assoct 
ated with Gether cases are only ever futh-failure worlds Consider Gilbert Har- 
man's case of the political assassination in Thought (Princeton: Univernty Press, 
1978), pp 143-44. In hus story, Jill has good evidence for there having been an as 
sassination, and she happens not to hear what everyone else around her has 
heard—~a government denial of there having been any such event. If she had 
heard the denial, though, she might well have ceased to believe that there had 
been an assassination. She would at least have ceased to have the same good evr 
dence for believing that there had been an assassination. So, in this Gettier case 
the threat is not to the truth component of Jill’s putative knowledge. Instead, the 
threat is to the behef component and/or the justification component. 
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tier case, on the other hand, the epistemic subject has a well-justified 
true belief in spite of the strange occurrence. In a helpful case, it 
would have been so easy for the strange occurrence not to have been 
present—and hence (all else being equal) for the well-justified true 
belief to have been absent. In a dangerous Gettier case, it would have 
been so easy for the strange occurrence not to have allowed the well- 
justified true belief to be present. So, with each kind of Gettier case, 
there are close possible worlds where at least one member of the cru- 
cial triumvirate-—the truth that p, the epistemic subject’s belief that 
p, and his good justification for that belief—is absent. In other 
words, for each kind of Gettier case there are pertinent and close 
epistemic-failure worlds for the case’s epistemic subject in relation to 
p (the propositional object of his wellhustified true belief). 

Nevertheless, we need not infer from this that each Gettier case’s 
epistemic subject lacks the pertinent knowledge that p. Instead, sec- 
tion I above has provided a conceptual framework within which we 
may interpret those epistemic subjects as having knowledge —failable 
knowledge. FailK2 entails that, if the epistemic subject’s belief is 
knowledge, it is failable knowledge. It entails this, because we have 
seen just now that FailK2(2) is satisfied: for each Gettier case, there 
are associated epistemic-failure worlds (indeed, close ones for the 
case’s epistemic subject in relation to p). Hence, if FailK2(1) is also 
satisfied, the epistemic subject’s belief is failable knowledge. And is 
FailK2(1) also satisfied? 

The usual answer to that question would be this: “No. Within a 
Gettier case there is simply a lack of knowledge.” To which I reply: 


But the supposed lack of knowledge is due only to the strange occur- 
rence; and I have interpreted the strange occurrence as contributing 
just to FailK2(2)’s being satsfied—that is, to the failability as such, 
rather than to denying the knowing as such (the knowing referred to in 
FailK2(1)) The point 1s that we need not assume that the strange oc- 
currence’s presence denies the epistemic subject knowledge; instead, 
we may view it as affecting how the epistemic subject knows—such as the 
strength or quality or embeddedness of that knowledge, its failability 
rather than its presence. 


In particular, I advocate this interpretation: one’s having a welljustified 
true belief that p 1s sufficient for one’s knowing that p. But if there is a 
strange occurrence within one’s context, then the knowing is failable. 


In fact, we may strengthen that interpretation, by calling on FailK3 
as well. Because an epistemic subject’s being in a Getter situation 
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turns what would otherwise be some distinctive and distant epistemic- 
failure worlds into close ones, we may interpret such situations in the 
following terms: 


One’s having a wellyjustified true belief that p is sufficient for one’s 
knowing that p. But if there is a strange occurrence within one’s con- 
text, then the knowing is very failable 


This does justice to the feeling that there must be a difference in the 
quality of the instances of knowing in, respectively, a normal situa- 
tion where there is failable knowledge that p, and a Gettier situation 
where there is failable knowledge that p. In a normal situation, 
there is a well-justified true belief that p and there is no strange oc- 
currence. In a Gettier case, there is a welljustified true belief that p 
and there is a strange occurrence. That makes some of the epis 
temic-failure worlds for x in relation to p much closer than they 
would normally be for an epistemic subject, within x’s world, when 
knowing that p. Let us see why that is so (first in a general way; then 
by applying those general remarks to a particular example). 

The general explanation. (i) If x knows that p within a normal (non- 
Gettier) situation, then none of the close—the similar-—-accessible 
worlds, as regards that knowledge of x}, is an epistemic-fathie world 
for xin relation to p. That is, in every close world where at least two 
of the three traditional components of xs knowing that p are present, 
the third component is also present. Because (by our present hy- 
pothesis) x knows that p in a normal way, that normality is also pres- 
ent within any very close worlds where at least two of those traditional 
components are present, with the result—the normal result, after 
all—being that the third component is also present.’ (ii) On the 
other hand, if x knows that p within a Gettier situation, then there are 
some close accessible worlds, as regards that knowledge, which are 
epistemic-failure worlds for x in relation to p. This is because the 
close accessible worlds, as regards x’s knowing that p within a Gettier 
case, are not simply worlds where at least two of the three traditional 


"* Presumably, worlds where at least two of the three components are present but 
where skepical poembilites are actualized—with the third component therefore not 
also being present—-are rather distant ones. Worlds where a Cartesian evil genius 
falsifies all of one’s welljustified beliefs, say, or where one's well-supported inductive 
predictions are falsified by a sudden Humean alteranon to the world, are hke that 
(unless, of course, our world 1s more like an actualizaton of such a world than we 
take it to be). Note, too, that a skeptic might regard those worlds as including ones 
which contain various Gather situations For more on how a skepnc might claim to 
find skeptical potential in possible Getter cases, see my “Gettleristic Scepticism,” 
Australasian Journal of Philosophy, LXXIV, 1 (March 1996) 83-97 
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components of x’s knowing that p are present; they are also worlds 
that contain the strange occurrence from x’s Gettier situation. And 
in some of these worlds the strange occurrence’s presence ensures 
that the third traditional component of x’s knowing that p is not pres- 
ent. Relative to a specific case, there need not be worlds like this for 
each one of the three components, but there will be some for at least 
one of the three. For example, the luck with which, given the strange 
occurrence, x’s good evidence for p and his believing that p has coin- 
cided in this world’s Gettier situation with p’s being true, might be 
modeled by there being some close truth-failure worlds for x in rela- 
tion to p—some close truth-failure worlds for x in relation to $, that 
is, where the strange occurrence is also present. 

A particular case. We may now apply those general remarks to a 
particular case—Chisholm’s Sheep Case, say. Let S be the proposi- 
tion that there is a sheep in the field (that is, the field in front of x at 
the time in question). Is x’s knowing that Sin a normal situation so 
much less failable than his knowing that Sin a Gettier situation (the 
one described by Chisholm)? 

(i) If x knows in a normal way that S, then the close worlds that 
determine how failable that knowledge is are these ones: 


The close worlds where x has a true belief that S£ 


The close worlds where x believes that Sand x has good sensory evidence 
E for $” 


The close worlds where Sis true and x has good sensory evidence E for S. 


These are the various close accessible worlds—more specifically, the 
close worlds where at least two of the three traditional components 
of x’s knowing that S are present. And in each of those worlds, too, 
the third traditional component of x5 knowing that Sis present: 


In the close worlds where x has a true belief that S, x also has good 
sensory evidence È for $. 


In the close worlds where x believes that S and x has good sensory evi 
dence Efor S, Sis also true. 


In the close worlds where Sis true and x has good sensory evidence E 
for $ x also believes that S$. 


” Throughout my discussion of this case, I shall use ‘E’ to denote what is qual- 
tatively the same sensory evidence as x has in this world for S—which 1s to say that 
Es its appearing to x, on the basis of his looking at the field, that there is a sheep 
in that field in front of hım. 
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These regularities obtain because those worlds are as similar to this 
world as is compatible with their containing at least two members of 
the set {S’s being true, x’s believing that S, x’s having good sensory 
evidence Efor S$}, and because in this world there is, say, a normal 
kind of reliability underlying the fact of x’s having all three members 
of that set. Accordingly, in those other worlds also, all three mem- 
bers of the set are present. The result is that none of these close 
worlds is an epistemic-failure world for xin relation to S.” 

(i) If x knows that S within a Gettier situation, then the close 
worlds that determine how failable that knowledge is are these ones: 


The close worlds where x has a true belief that S, and where the strange 
occurrence from x’s Gettier situation also obtains. 


The close worlds where x believes that Sand x has good sensory evidence 
E for $, and where the strange occurrence from x’s Gettier situation also 
obtains. 


The close worlds where S is true and x has good sensory evidence E for 
S, and where the strange occurrence from x's Gettier situation also 
obtains. 


Let that strange occurrence be the fact that x is really only looking at 
a disguised dog (one which nevertheless seems to x to be a sheep).” 
Then the second of those three groups of worlds, at any rate, con- 
tains some epistemic-failure worlds for xin relation to $. Each world 
in that second group contains x’ believing that S and x’s having 
good sensory evidence E for S, where the presence of Eis due, how 
ever, to x’s looking at what is really a disguised dog (one which hap- 
pens to look to x exactly like a sheep); yet in some such worlds, S is 
not true. (In some of them, for example, the real sheep, which was 
out of x} sight anyway, has wandered out of the field for a minute or 
two, including the moment when x is looking at the field.) Those 
worlds are thus close epistemic-failure worlds for xin relation to p. 
But how close are they to this world——that is, to this world when it 
includes x’s knowing that S within a Gettier situation? In general 


* Obviously, there are epistemic-faulure worlds for xin relation to $. But given 
the normality of how—and therefore the stability with which—x u being sup- 
posed to know that Sın this world, those worlds are not close ones. 

* J am not building into my characterization of the strange occurrence the fact 
that there is a sheep elsewhere in the field. (On how “large” a strange occurrence 
needs to be, see footnote 12 above.) In any event, we could modify the criterion of 
closeness for the worlds we are discussng, so that they include either the strange 
occurrence orsome vital part of it. 
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(and from FailK3), x’s knowledge that Sis as failable as there are nu- 
merous and close epistemicfailure worlds for xin relation to S$. And 
(as we have seen just now) when x knows that S within a Gettier case 
that knowledge is at least more failable than x’s normal knowledge 
that S would be: there are no close epistemic-failure worlds for x in 
relation to $, when x is in a normal situation, and there are some 
such worlds when x is in a Gettier situation. Surely, we may say some- 
thing even stronger than that, though. Surely, we may say this: 


x's knowledge that S within the Gettier situation is not just more fail- 
able than his normal knowledge that S would be; it is much more fail 
able than his normal knowledge that $ would be. 


After all, even when x knows that S within the Gettier situation, this 
occurs within what is otherwise (we may assume) a normal world; 
and within such a world, a person would be unlikely to gain a true 
belief that $ by looking at a disguised dog and seeing it, quite reason- 
ably, as being a sheep (and by seeing no other animals in the field). 
Hence, while the normality of the world in general (when x knows 
that Sin a normal situation) makes all epistemic-failure worlds for x 
in relation to § distant ones, the normality of the world in general 
apart from the strange occurrence in x$ situation (when x knows 
that $ within a Gettier case) makes at least some epistemic-failure 
worlds for x in relation to S close ones. So, once more (and other 
things being equal), knowledge that p within a Gettier case is much 
more failable than knowledge that p within a normal situation. 
Where does all of that leave us? I am saying—nonstandardly, I re- 
alize, but coherently, I claim—that the strange occurrence in a help- 
ful Gettier case contributes to the epistemic subject's having 
knowledge, by helping to give him a welljustified true belief. By the 
same token, though, I am conceding that the knowledge is present 
only in a very fragile or finely balanced way. So, it is very failable. It 
is present but might easily not have been. The strange occurrence 
within a dangerous Gettier case, on the other hand, does not give 
the epistemic subject some failable knowledge. On the contrary: 
without the strange occurrence within a dangerous Gettier case, the 
epistemic subject’s welljustified true belief would be knowledge in a 
normal way. With the strange occurrence’s presence within a dan- 
gerous Gettier case, though, the knowledge is present only in a very 
fragile or finely balanced way. So, it is very failable. It is present but 
might easily not have been. Thus, what is true in this respect of 
helpful Gettier cases is no less true of dangerous Gettier cases. 
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I cannot prove that this is how Gettier cases must be interpreted. 
But I have done enough, I believe, to prove that the usual view of 
Gettier cases is not mandatory. It is standard—but not therefore 
true. And it is certainly not mandatory, given that we now have be- 
fore us a coherent and independently motivated way to interpret 
each Gettier case’s epistemic subject as having failable knowledge 
within his case.” 

(B) Gettier cases presented us with what was an epistemological 
puzzle only on the assumption that there can be failable knowledge.” 
And the equation of knowledge with well-justified true belief had 
seemed like the obvious account with which to understand that as- 
sumption. But as soon as some cases—specifically, the Gettier 
cases—were encountered where the epistemic subjects satisfied that 
account while almost not doing so, the account was taken to be 
faulty. Each Gettier case’s epistemic subject had a welljustified true 
belief—while almost not doing so. Hence, well-justified true belief 
was taken to be an inadequate account of knowing. Epistemologists 
were impressed by how easy it is to imagine changes to the circum- 
stances within a Getter case, changes which would have led to the 
case’s epistemic subject not having the welljustified true belief he 
actually has." They inferred that this shows that his having the 
welljustified true belief that p which he actually has is not enough 
for his knowing that p—and hence that knowledge is more than a 
welljustified true belief. 

But that did not follow at all. The fact that simple changes to the 
epistemic subject’s situation are possible, and that in the changed sit- 


” Dare I mention, too, how hard—or worse —1t has been for epistemologists to 
agree on an analyms of knowledge while also accepting, as they do, that in Getter 
cases there 1s a lack of knowledge? My speculation is that the various analyses are 
mistaken to at least the extent that the acceptance underlying them—the accep- 
tance that ın Gettier cases there 1s a lack of knowledge—1s mistaken. At any rate, 
other theoretical options should be formulated and discussed; I am offering one 
such option 

™ If knowledge is required to be tnfailablie, there is no independent puzzle 
about why there is no knowledge within Gettier cases The knowledge would be 
absent because the epistemic subject’s justification, say, is less than perfect; and 
that would be that Gettier cases are puzzling only if we believe that knowledge 
need not be infailable—for instance, it can be fallible (For then our best initial 
theory of such knowledge—which allows it to be fallibly-but-well-yusnfied true be- 
lief, say—is satisfied, and accordingly we should think that knowledge is present. 
Yet epistemologists have not thought that it ıs present. Thus, they have generated 
a puzzle for themselves.) 

= In helpful Gether cases, the change is the strange occurrence’s being absent 
(for example, the sheep's not being in the field). In dangerous Gettier cases, the 
change is the strange occurrence’s “carrying out” its threat (for example, Henry’s 
actually being misled by a fake barn). 
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uation he would not retain his welljustified true belief that p, does 
not entail that—while still within his actual, unchanged, 
situation—he does fail to know that p. It shows only that (and how) 
he would fail to know that p, if things were other than as they are in 
a particular respect. The fact that he would not have known that p 
does not entail that he does not know that p. Actual knowing does 
not entail counterfactual knowing. 

Admittedly, only a small alteration to the context is needed to 
bring about the imagined counterfactual situaton—and unarguably 
the epistemic subject does lack the knowledge within that altered sit- 
uation. Thus, the epistemic subject could easily fail to have that 
knowledge. And this might be thought to show that the alternative 
situation is similar enough to the actual situation to be epistemically 
relevant to the actual situation. It might be thought that therefore 
one’s knowing within the actual situation does depend on one’s 
knowing within the counterfactual situation—so that, because one 
fails to know in the latter situation, one also fails to know in the for- 
mer situation. It might be thought that the epistemic subject does 
fail to know, because he could easily fail to know. It might be 
thought that this is why there is a lack of knowledge in the actual 
situation —the Gettier situation. 

Well, that is the usual way of interpreting Gettier cases. But we 
have found another way to do so—and now I shall extend this alter- 
native interpretation a little. If we are to understand the nature of 
failable knowing, we could say instead (and this is my key observa- 
tion here) that the closeness of the alternative possible situation—in 
which the epistemic subject definitely does fail to know that p— entails 
only that the epistemic subject’s actual situation is almost one where 
he has that lack of knowledge. Because he could so easily fail to 
know, he does almost fail to know. His knowing is very failable, be- 
cause he almost fails to know. The closeness of the alternative possi- 
ble situation in which he does fail to know does not entail that the 
actual situation is one where there is a lack of knowledge. After all, 
in fact the small alteration to his actual situation has not occurred. 
And even if we feel that the person is just lucky that it did not occur, 
the fact remains that it did not occur. This is a fact which should be 
reflected in our assessment of the epistemic situation. It certainly 
plays a part in mine: 


Within each Gettier case, the epistemic subject knows, because he has a 
wellbjustified true belief. But he is very lucky to have all of that (and 
thereby the knowledge). This luck is a mark of how very failably he knows 
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within the Getter case. He does know—even though he almost failed to 
do so. He does know—even though he 1s very close to not doing so. 


Consider an analogous situation. Maybe there is luck involved in 
life’s ever having come to exist in this universe. There are so many 
possible—and slight—differences to the universe's initial conditions 
which, had they been actual, would not have led to the creation of 
life. So, by having the initial conditions it had, the universe was close 
to having different initial conditions—and thereby to not containing 
life. Nevertheless, it did not have those different initial conditions. 
Most of us do not say that consequently there are no natural laws 
within the universe. There are, in spite of the luck involved in their 
obtaining. And failable knowing within a Gettier case is like that. In 
each such case, the epistemic subject is very close to failing to know 
that p. He almost fails to know that p. And his being so close to fail 
ing to know that p is what misleads people into thinking that he does 
fail to know that p. Instead, we may say that his being so close to not 
knowing that p is simply part of his knowing very failably that p. 

(C) That interpretation of Gettier cases clashes with standard “in- 
tuitions” about the cases. But intuitions without developed theories 
to accompany them are blind; and as yet there is no deeply happy 
marriage between those intuitions and a theory of failable knowl- 
edge. (More limitedly, there is not even a union like that between 
those intuitions and theory of fallible knowledge.) What is more, we 
should wonder what underdeveloped theoretical commitments are 
lurking behind those “intuitions.” Epistemological reactions such as 
those “intuitions” are not givens. Might they be mistaken? Of 
course they might. Might they be based upon some false or mislead- 
ing theoretical views that are unreflectively being taken for granted 
by epistemologists? Of course they might. 

For example, epistemologists should ask themselves whether their 
so-called “intuitions” about Gettier cases might have relied implicitly 
upon the following thesis: 

KNAF: a person knows that p, only if it is not the case that he almost 

fails to know that p That is, knowing must not be a “near thing.” It 

must not be lucky. It must not almost not occur (if it is to occur). This 

is 80, even for failable knowledge. 


Undoubtedly, an adherence to KNAF would lead to the usual view of 
Gettier cases (because within a Gettier case the epistemic subject 
does at least come close to failing to know—and because, according 
to KNAF, that suffices for failing to know, which ts the usual view of 
what happens within a Gettier case). And it is not hard to imagine 
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how an epistemologist could be attracted to that thesis, KNAF. It 
would flow smoothly, if there was also an attraction to this thesis: 


What is not hard to achieve is not easy not to achieve. And knowledge 
is not hard to achieve. It is absent if, in the circumstances, it could eas- 
ily have been absent. 


Presumably, the idea behind such thinking is that there is something 
particularly “everyday”—and correlatively robust or resilient—about 
knowing. Such thinking might well seem plausible to many episte- 
mologists. 

Nevertheless, it is fundamentally mistaken thinking. Failable know- 
ing can be achieved either robustly or not, in an “everyday” way or not; 
either way, the result remains failable knowing. Indeed, presumably its 
being achieved less than robustly—in an unusual way—is more likely, 
given its failability; indeed again, that can be part of its failability. Know- 
ing can be at least somewhat lucky knowing— perhaps more so when it is 
failable knowing. It can be nonstandard or unusual knowing—again, 
perhaps more so when it is failable knowing. The usual reaction to 
Gettier cases, it seems to me, confuses failable knowing with failable 
and robust knowing. It confuses failable knowing with failable and 
wholly standard or usual knowing. The knowing inside a Gettier case is 
failable—but not failable and robust, or failable and wholly standard 
or usual. The fact of some knowing’s being at least somewhat lucky, 
though, does not entail that it is not knowledge.” 

In that way, it is open to us to view strange occurrences within Get- 
tier cases as mainly affecting the quality of the knowing. First and 
foremost, they are a mark of its failability—-even a mark of its marked 
failability. Within a Gettier case, the epistemic subject knows—but 
almost did not. Within a Gettier case, the knowledge is more failable 
than most of our failable knowledge. (That is because Gettier cases, 
unlike most situations in which we have failable knowledge, contain 
strange occurrences.) Epistemologists have misinterpreted Gettier 


™ Any temptation to deny that this is so should be tempered by a realizanon of 
how demanding a conception of knowledge is implied by such a denial. To insist 
that there can never be any luck in someone’s eee to be an ttfail- 
abilist and an infailabilist’s conception of knowl even more demanding 
than an infallibilist’s. From FalK, an infallibilist says ee knowledge’s fustificanon 
component entails the truth of its belief component-—which 1s to say that, if the 
belief component and the justification component are present, then the truth 
component has to be present, too. But an infailabilist says that, if any two of the 
three component are present, then the third one has to be present. So, an infalli- 
bilst concepti an pee upon knowledge only one of the three requirements 
placed upon knowledge by an infailabilist conception. An infalhbilist conception 
is thus much less demanding than an infailabilist one. 
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cases, in part by assuming that there is only one coherent way to inter- 
pret the cases. There has been a general desire to interpret knowing 
less demandingly (so that people can have knowledge comparatively 
easily) —while nevertheless also adopting what is really a more de- 
manding conception of knowing (so that people in Gettier cases lack 
knowledge). But that is simply an incoherent set of attitudes. One of 
them has to be discarded. I am outlining a way to do that, by inter- 
preting knowing less demandingly (and, correlatively, allowing that 
people in Getter cases do not lack knowledge). 

Is it too undemanding an interpretation, though? Does it imply, 
for instance, that we no longer need to have good justification at all 
for p if we are to know failably that p? In particular, does it imply 
that this is true of the very failable knowledge which, I have argued, 
is present within Gettier cases? Here is an objection to my interpre- 
tation, based upon the suspicion that it does have that implication: 


The interpretation in this paper claims that there is very failable knowl 
edge that p within Gettier cases—that there is knowledge present but 
only somewhat luckily. But the luck that is present within Gettier cases 

is a feature, first and foremost, of how the true belief that p 1s present 

And this suggests that knowledge need not include justification: if 

knowledge can be as failable as the interpretation claims it to be within 

Gettier cases, then knowledge can be a mere true belief—with the luck 

in the true beliefs presence constituting the failability of the knowl 

edge. The interpretation 1s thus congenial to relinquishing the tradi 
tional justification condition on knowing. 

But that objection to my interpretation fails. Within many a Get- 
tier case there is indeed some luck; and we might well wish to de- 
scribe it as there being some luck in the true belief’s having been 
gained. Such a description would be misleadingly brief, though. A 
fuller, and more accurate, description would say that the luck is a 
matter of the belief’s being true, given how the belief has been 
reached—where in fact it has been reached by the epistemic subject’s 
relying on his evidence. For example, let us suppose that we are in- 
clined to view Chisholm’s Sheep Case as one whose epistemic subject 
x is rather lucky to have a true belief that S (that is, the true belief 
that there is a sheep in the field). Of what, precisely, is that luck a 
featurer Is the luck simply a matter of S’s being luckily true, say? I 
do not think so: there is a sheep in the field, and perfectly normally 
so, with S’s truth as such thereby being no less normal. For the same 
reason, too, the luck is not strictly a matter of x’s belief that S being 
luckily true. Instead, the luck pertains to x’s having the true belief 
that Son the basis of his good sensory evidence E. There are differ- 
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ent ways in which we might try to explicate that accidentality. One 
popular way is to say that because E is misleading x into believing 
that there is a sheep in front of him (remember that E is being pro- 
duced within x by a dog who is disguised to look like a sheep), x is 
somewhat lucky then to gain the true belief that S on the basis of E 
Yet, however we explicate it, the fact remains that the luck within the 
case is a relationship—-one about whose more precise nature there 
ig epistemological debate—between (i) S’s being true and/or x’s be- 
lieving that S, and (ii) x’s relying on E in gaining that belief. So, 
even when a Gettier case’s epistemic subject has his true belief some- 
what luckily, and even if (as I have argued) that true belief is correla- 
tively failable knowledge, we need not infer from this that knowledge 
does not have to include a justification component. This is because 
the luck in the true belief’s presence within the Gettier case is only 
present insofar as the good evidence is also present and insofar as 
the true belief is based on that evidence. In other words, the luck is 
not really a feature of the trus bektef’s presence as such; it is a feature 
of the justified true beliefs presence. And thus the objection fails. 
Of course, its failure does not entail, in turn, that all knowledge does 
need to include justification. It does entail, though, that this paper’s 
interpretation of Gettier cases in particular does not entail that 
knowledge need not include justification.™ 

(D) Let us suppose that a chosen few of the brush strokes consti 
tuting the Mona Lasa were so integral to it that if they had been dift 
ferent, the painting would have suffered greatly. Call these the 
strokes. If they had been absent, the painting would have lacked the 
sensitivity and accuracy which (I assume) it has. Naturally, even now 
it is not a perfect painting. It is fine; yet it would not have been fine 
if the strokes had been absent. Moreover, they might easily have 
been absent, with Leonardo da Vinci having chosen between them 


* In saying this, I am not denying that there could be some knowledge that does 
not include justification; I am saying only that the knowledge within Getter cases is 
not hke that, in spite of that knowledge being as failable as it s. Recently, Crispin 
Sartwell has argued for the even stronger thess that knowledge is mere true belief 
(and hence, strictly speaking, never includes justification) — “Knowledge Is Merely 
True Belief,” Amencan Philosophical Quarter, xxv, 2 (April 1991): 157-65, and 
“Why Knowledge Is Merely True Belief,” this JOURNAL, LXXXIX, 4 (April 1992): 167- 
80. Elsewhere (in part of a manuscript, “Good Knowledge, Bad Knowledge”), I 
have rejected Sartwell’s pomton, while arguing that there could be knowledge 
which is unjustified. That argument is independent of the one here But if—on 
independent grounds—we conclude that there could be instances of knowl 
that do not include justification, then we may extend my analysis of failability cor 
relatively: the pertinent epistemic-failure worlds for those cases of unjustified 
knowledge would be ones where one and only one of the truth component and 
the belief component is present. 
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and some “competing” strokes, say. In fact, however, they were not 
absent. As a result, the picture is the fine portrait that it is. 

But imagine that initially the strokes had been absent, with “com- 
petitors” taking their place—and that da Vinci, bleary-eyed and 
tired, and barely aware of what he was doing, proceeded to replace 
the competitors with the strokes. The Mona Lisa as we know it was 
the product of his doing so. But his doing so also means the paint- 
ing’s having been produced in a way that is analogous to the produc- 
tion of a welljustified true belief within a Gettier case. (The strokes 
having been absent, and then all-but-accidentally being included af- 
ter all, is this case’s analogue of a strange occurrence within a Get- 
tier case— specifically, a helpful Gettier case.) The painting is 
accurate (which is like a beliefs being true); its accuracy is sensitively 
rendered (which is like a belief’s being justified); and a nontrivial 
kind of luck (which is like a strange occurrence’s being present) 
contributed to the painting’s ending up with that combination of ac- 
curacy and sensitivity. Because of the luck, the painting almost 
failed to be as fine as it is; even so, that is what it is—fine, very fine. 

An analogous moral pertains to knowing failably. A beliefs being 
produced, in part, by a strange occurrence such as is present in help- 
ful Gettier cases does not prevent its being failable knowledge (so long 
as the belief is true and well justified). We may interpret the strange 
occurrence, instead, as part of why the knowledge—the wellhjustified 
true belief—which it helps to produce is only failable. The belief’s 
nevertheless being knowledge is like the Mona Lisa's being a fine and 
accurate painting even if its combination of accuracy and sensitivity 
was to have been partly due to luck. The failability of the painting’s 
high quality would be partly due to the luck: it is a fine painting, but 
(in the imagined situation) it almost was not. 

(E) Sometimes, boldness is needed when claiming or attributing 
knowledge that is failable. My way of speaking about knowing makes 
clear why that is so: 


A person knows failably that p, if and only if he is at least a little close to 
not knowing that p. He knows very failably that p, if and only if he is 
very close to not knowing that p. (Equivalently: a person knows very 
failably that p, if and only if he almost fails to know that p.) 


To attribute failable knowledge is to attribute knowledge that might 
have been absent. The more failable it is, the less confident we 
might be that itis knowledge. But this should not worry us; in partic- 
ular, it should not lead to our precipitately assuming that knowledge 
that p is absent whenever our confidence in its being present begins 
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to ebb away (such as occurs with Gettier cases). We need to keep in 
mind that—other things being equal—the more failable a piece of 
knowing is, the less confident we should be that it is knowledge. 

That might sound mistaken: “Surely, the more failable a piece of 
knowledge is, the more confident we should be that it is knowledge. 
After all, the more failable it is, the lower a justificatory standard 
needs to be satisfied by it, for example.” Not so (I say): the less fail- 
able a case of knowing is, the further away it is from not being knowl- 
edge; accordingly, the more confident we should be that it is 
knowledge. If one knows tnfailably that p, one is maximally far away 
from not knowing that p. If ever one should be confident that one 
knows that p, it is in this case.™ 

Naturally or conventionally (I shall not hazard a view as to which), 
knowing has boundaries. Definitionally, the lower boundary of fail 
able knowing divides (i) not knowing while almost knowing, from (ii) 
knowing while almost not knowing. Why should we not also 
distinguish —within the category of failable knowing-——between (ii) 
knowing while almost not knowing, and (iii) knowing while not even 
almost not knowing? Part of my thesis is that we should be on the 
alert for instances of that distinction. And the distinction between (a) 
the epistemic subject’s well-justified true belief within a Gettier case, 
and (b) most normal failable instances of knowing, is such an in- 
stance: (a) stands to (b) as (ii) stands to (ili). Moreover, why should 
we not describe cases of (ii) as being cases of more failable knowing 
than are cases of (ili) —cases of knowing which are closer to not being 
cases of knowing? Part of my thesis is that we should do so, since it 
marks a useful way of reflecting more carefully on various people’s 
epistemic situations. The knowledge within a Gettier case, say, is more 
failable than most. When one is in a Gettier case, one is very close to 
not knowing. That being so, it is not surprising that beliefs within 
Gettier cases have been mistaken for instances of not knowing. When 
working at the boundary of a concept, mistakes occur easily; this is 
why philosophical counterexamples can be puzzling. Gettier cases 
are a perfect example of this—of philosophical counterexamples 
that have generated puzzlement in that way. In particular, they are 
cases that test the lower boundary of the concept of failable knowing. 


™ What one n entitled not to be so confident about in that event is the infailabil- 
ity as such of one’s infailable know! that p To be confident that some knowl 
edge that p is infailable, as agamst utinfailable, 1s not as warranted as is the 
fact that, ether way, one knows that ~—1n short, that one knows simpliciter that p 
(We shall with this only if we believe that knowledge must be infailable, 
hence tf we believe that all knowledge simpliciter ts infailable knowledge. But that 
1s not a belief I am modeling here.) 
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The problem has been that epistemologists——through the mistaken 
kind of caution discussed in subsection (C), I surmise— have placed 
the cases on the wrong side of that boundary. Stll, this was almost 
not a mistake by epistemologists. We can say that much for it, I sup- 
pose. A careful initial reaction to Gettier cases would be to say that 
their epistemic subjects either lack knowledge or have it but only just. 
Some knowing has to be at the border, between knowing and not 
knowing— that is, some knowing is almost not knowing. It would be 
easy to mistake such knowing for not knowing. And this is precisely 
what, it seems to me, epistemologists have done in their responses to 
Gettier cases. At the very least, therefore, epistemologists should abam 
don their customary confident belief that in Gettier cases knowledge is 
absent—and that this is the only reasonable interpretation of the 
cases. It is not 
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BARGAINING WITH NEIGHBORS: IS JUSTICE CONTAGIOUS? 


hat is justice? The question is harder to answer in some 
cases than in others. We focus on the easiest case of dis- 
tributive justice. Two individuals are to decide how to dis- 
tribute a windfall of a certain amount of money. Neither is 
especially entitled, or especially needy, or especially anything—their 
positions are entirely symmetric. Their utilities derived from the dis- 
tribution may be taken, for all intents and purposes, simply as the 
amount of money received. If they cannot decide, the money re- 
mains undistributed and neither gets any. The essence of the situa- 
tion is captured in the simplest version of a bargaining game devised 
by John Nash.’ Each person decides on a bottom-line demand. If 
those demands do not jointly exceed the windfall, then each person 
gets his demand; if not, no one gets anything. This game is often 
simply called dsuide-the-dollar. 
In the ideal simple case, the question of distributive justice can be 
decided by two principles: 


Opismahty. a distribution 1s not just if, under an alternative distnbution, 
all recipients would be better off. 

Equity: if the position of the recipients is symmetric, then the distribu- 
tion should be symmetric. That is to say, it does not vary when we 
switch the recipients. 

Since we stipulate that the position of the two individuals is symmet- 
ric, equity requires that the just distribution must give them the 
same amount of money. Optimality then rules out such unlikely 
schemes as giving each one dime and throwing the rest away—each 
must get half the money. 

There is nothing new about our two principles. Equity is the sim- 
plest consequence of the theory of distributive justice in Aristotle’s 
Polktics. It is a consequence of Immanuel Kant’s categorical impera- 
tive. Utilitarians tend to stress optimality, but are not completely in- 
sensitive to equity. Optimality and equity are the two most 
uncontroversial requirements in Nash’s axiomatic treatment of bar- 
gaining. If you ask people to judge the just distribution, their an- 
swers show that optimality and equity are powerful operative 
principles.* So, although nothing much hangs on it, we shall feel 


' “The Bargaining Problem,” Econometrica, xvi (1950): 155-62. 
* Menachem Yaan and Maya Bar-Hillel, “On Dividing Justly,” Social Chore and 
Welfare, 1 (1981): 1-24. 
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free to use moral language and to call the equal split fatr drvision in 
divide-the-dollar. 
L RATIONALITY, BEHAVIOR, EVOLUTION 

Two rational agents play the divide-the-dollar game. Their rational 
ity is common knowledge. What do they do? The answer that game 
theory gives us is that any combination of demands is compatible with 
these assumptions. For example, Jack may demand ninety percent 
thinking that fill will only demand ten percent on the assumption 
that Jill thinks that Jack will demand ninety percent and so forth, 
while Jill demands seventy-five percent thinking that Jack will de- 
mand twenty-five percent on the assumption that Jack thinks that Jill 
will demand seventy-five percent and so forth. Any pair of demands 
is ratonalzable, in that it can be supported by a hierarchy of conjec- 
tures for each player, compatible with common knowledge of ration- 
ality. In the example given, these conjectures are quite mistaken. 

Suppose we add the assumption that each agent somehow knows 
what the other will demand. Then any combination of demands 
that total the whole sum to be divided is still possible. For example, 
suppose that Jack demands ninety percent knowing that Jill will de- 
mand ten percent and fill demands ten percent knowing that Jack 
will demand ninety percent. Then each player is maximizing payoff 
given the demand of the other. That is to say that this is a Nash 
equilibrium of divide-the-dollar. If the dollar is infinitely divisible, 
then there are an infinite number of such equilibria. 

If experimental game theorists have people actually play divide- 
the-dollar, they always split equally.’ This is not always true in more 
complicated bargaining experiments where there are salient asym- 
metries, but it is true in divide-the-dollar. Rational-choice theory has 
no explanation of this phenomenon. It appears that the experimen- 
tal subjects are using norms of justice to select a particular Nash 
equilibrium of the game. But what account can we give for the exis- 
tence of these norms? 

Evolutionary game theory (reading ‘evolution’ as cultural evolu- 
tion) promises an explanation, but the promise is only partially ful 
filled. Demand-half is the only evolutionarily stable strategy in 


> Rudy V Nydegger and Guillermo Owen, *Two-Person Bargaining: An Expen- 
mental Test of the Nash Axioms,” International Journal of Game Theory, m (1974): 
239-50; Alvin Roth and Michael Malouf, “Game Theoretic Models and the Role of 
Information in Bargaining,” Psychological Revwew, LXXXVI (1979): 574-94; John Van 
Huyck, Raymond Batallio, Sondip Mathur, Patsy Van Huyck, and Andreas Ort- 
mann, “On the Origin of Convention: Evidence From Symmetric Bargaining 
Games,” International Journal of Game Theory, xxiv (1995) 187212. 
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divide-the-dollar.* It is the only strategy such that, if the whole pop- 
ulation played that strategy, no small group of innovators, or “mu- 
tants,” playing a different strategy could achieve an average payoff at 
least as great as the natives. If we could be sure that this unique evo- 
lutionarily stable strategy would always take over the population, the 
problem would be solved. 

But we cannot be sure that this will happen. There are states of 
the population which are evolutionarily stable where some fraction 
of the population makes one demand and some fraction makes an- 
other. The state where half the population demands one third and 
half the population demands two thirds is such an evolutionarily sta- 
ble polymorphism of the population. So is the state where two thirds 
of the population demands forty percent and one third of the popu- 
lation demands sixty percent We can think of these as pitfalls along 
the evolutionary road to justice. 

How important are these polymorphisms? To what extent do they 
compromise the evolutionary explanation of the egalitarian norm? 
We cannot begin to answer these questions without explicitly model 
ing the evolutionary dynamics and investigating the size of their 
basins of attraction. 

Il. BARGAINING WITH STRANGERS 
The most widely studied dynamic evolutionary model is a model of 
interactions with strangers. Suppose that individuals are paired at 
random from a very large population to play the bargaining game. 
We assume that the probability of meeting a strategy can be taken as 
the proportion of the population that has that strategy. The popula- 
tion proportions evolve according to the replicator dynamics. The 
proportion of the population using a strategy in the next generation 
is the proportion playing that strategy in the current generation mw 
tiplied by a fitness factor. This fitness factor is just the ratio of the av 
erage payoff to this strategy to the average payoff in the whole 
population.’ Strategies that do better than average grow; those 
which do worse than average shrink. This dynamic arose in biology 


t Robert Sugden, The Economics of Rights, Cooperation, and Walfare (New York: 
Blackwell, 1986). 

* This is the discrete tme version of the replicator dynamics, which is most rele- 
vant in comparison to the alternative bargarnng-aath-naghbors dynamics conndered 
here. There is also a continuous time version. As comprehensive references, see 
Josef Hofbauer and Karl Sigmund, The Theory of Evolution and Dynamical Systems 
(New York: Cambridge, 1988); Jorgen W. Wei Evolutionary Games Theory (Car 
bridge: MIT, 1995); Larry Samuelson, Evokdonary Games and Equihbrrum Selection 
(Cambndge: MIT, 19977). 
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as a model of asexual reproduction, but more to the point here, it 
also has a cultural evolutionary interpretation where strategies are 
imitated in proportion to their success.* 

The basins of attraction of these polymorphic pitfalls are not negligi 
ble. A realistic version of divide-the-dollar will have some finite num- 
ber of strategies instead of the infinite number that we get from the 
idealization of infinite divisibility. For a finite number of strategies, the 
size of a basin of attraction of a population state makes straightforward 
sense. It can be estimated by computer simulations. We can consider 
coarse-grained or fine-grained versions of divide-the-dollar; we can di 
vide a stack of quarters, or of dimes, or of pennies. Some results of sim- 
ulations persist across a range of different granularities. Equal division 
always has the largest basin of attraction and it is always greater than 
the basins of attractions of all the polymorphic pitfalls combined. If 
you choose an initial population state at random, it is more probable 
than not that the replicator dynamics will converge to a state of fixation 
of demand-half. Simulation results range between fifty-seven and sixty- 
three percent of the initial points going to fair division. The next 
largest basin of attraction is always that closest to the equal split: for ex- 
ample, the four-six polymorphism in the case of dividing a stack of ten 
dimes and the forty-nine/fifty-one polymorphism in the case of divid- 
ing a stack of one-hundred pennies. The rest of the polymorphic equi 
libria follow the general rule—the closer to fair division, the larger the 
basin of attraction. 

For example, the results running the discrete replicator dynamics 
to convergence and repeating the process 100,000 times on the 
game of dividing ten dimes are given in table 1. 





Table 1: Convergence results for replicator dynamscs - 100,000 trials 


* Jonas BySrnerstedt and Jorgen Weibull, “Nash Equilibnum and Evolution by 
Imitahon,” in Kenneth J. Arrow et aha, eds, The Rational Foundations of Economic Be 
havior (New York: Macmillan, 1996), pp. 155-71, Kari Schlag, “Why Imitate, and If 
So How?” Discussion Paper B-361 (University of Bonn, Germany, 1996). 
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The projected evolutionary explanation seems to fall somewhat 
short. The best we might say on the basis of pure replicator dynam- 
ics is that fixation of fair division is more likely than not, and that 
polymorphisms far from fair division are quite unlikely. 

We can say something more if we inject a little bit of probability 
into the model. Suppose that every once and a while a member of 
the population just picks a strategy at random and tries it out—per- 
haps as an experiment, perhaps just as a mistake. Suppose we are ata 
polymorphic equilibrium—for instance, the four-six equilibrium in 
the problem of dividing ten dimes. If there is some fixed probability 
of an experiment (or mistake), and if experiments are independent, 
and if we wait long enough, there will be enough experiments of the 
right kind to kick the population out of the basin of attraction of the 
four-six polymorphism and into the basin of attraction of fair division 
and the evolutionary dynamics will carry fair division to fixation. 
Eventually, experiments or mistakes will kick the population out of 
the basin of attraction of fair division, but we should expect to wait 
much longer for this to happen. In the long run, the system will 
spend most of its time in the fairdivision equilibrium. Peyton Young’ 
showed that, if we take the limit as the probability of someone experi 
menting gets smaller and smaller, the ratio of time spent in fair divt 
sion approaches one. In his terminology, fair division is the 
stochastically stable equilibrium of this bargaining game. 

This explanation gets us a probability arbitrarily close to one of 
finding a fair-division equilibrium if we are willing to wait an arbitrar- 
ily long time. But one may well be dissatisfied with an explanation 
that lives at infinity. (Putting the limiting analysis to one side, pick 
some plausible probability of experimentation or mistake and ask 
yourself how long you would expect it to take in a population of 
10,000, for 1,334 demand-six types simultaneously to try out being 
demand-five types and thus kick the population out of the basin of 
attraction of the four-six polymorphism and into the basin of attrac- 
tion of fair division.*) The evolutionary explanation still seems less 
than compelling. 


’ “An Evolutionary Model of Bargaining,” Journal of Economic Theory, LIX (1998): 
145-68, and “The Evolution of Conventions,” Econometrica, LXI (1998): 57-94; and 
Dean Foster and Young, “Stochastic Evolutionary Game Dynamics,” Theoretical Pop- 
ulation Biology, xxxvill (1990): 219-32. 

* For discussion of expected waiting times, see Glenn Ellison, “Learning, Local 
Interaction and Coordination,” Econometrica, LXI (1993): 1047-71; and Robert Axtell, 
Joshua M. Epstein, and H. Peyton Young, “The Emergence of Economic Classes in 
an Agent-Based Bargaining Model,” preprint (Brookings Institunon, 1999). 
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I BARGAINING WITH NEIGHBORS 

The model of random encounters in an infinite population that mo- 
tivates the replicator dynamics may not be the right model. Suppose 
interactions are with neighbors. Some investigations of cellular au- 
tomaton models of prisoner’s dilemma and a few other games show 
that interactions with neighbors may produce dynamical behavior 
quite different from that generated by interactions with strangers.’ 
Bargaining games with neighbors have not, to the best of our knowl 
edge, previously been studied. 

Here, we investigate a population of 10,000 arranged on a one hum 
dred by one hundred square lattice. As the neighbors of an individual 
in the interior of the lattice, we take the eight individuals to the N, NE, 
E, SE, S, SW, W, NW. This is called the Moore(8) neighborhood in 
the cellular automaton literature.” The dynamics is driven by imita- 
tion. Individuals imitate the most successful person in the neighbor- 
hood. A generation—an iteration of the discrete dynamics—has two 
stages. First, each individual plays the divide-ten-dimes game with 
each of her neighbors using her current strategy. Summing the pay 
offs gives her current success level. Then each player looks around 
her neighborhood and changes her current strategy by imitating her 
most successful neighbor, providing that her most successful neighbor 
is more successful than she is; otherwise, she does not switch strategies. 
(Ties are broken by a coin flip.) 

In initial trials of this model, fair division always went to fixation. 
This cannot be a universal law, since you can design “rigged” config- 
urations where a few demand-one-half players are, for example, 
placed in a population of demand-four and demand-six players with 
the latter so arranged that there is a demand-six type who is the most 
successful player in the neighborhood of every demand-one-half 
player. Start enough simulations at random starting points and 
sooner or later you will start at one of these. 

We ran a large simulation starting repeatedly at randomly chosen 
starting points. Fair division went to fixation in more than ninety-nine 
point five percent of the trials. The cases where it did not were all cases 
where the initial population of 10,000 contained fewer than seventeen 


* Gregory B. Pollack, “Evolutionary Stability on a Viscous Lattice,” Soctal 
Networks, xı (1989): 175-212; Martin A. Nowak and Robert M. May, “Evolutionary 
Games and Spatial Chaos,” Nature, cociix (1992): 826-29, Kristian Lindgren and 
Mats Nordahl, “Evolutionary Dynamics in Spanal Games,” Physica D, LXXV (1994): 
292-309, Luca Anderlini and Antonella Ianm, “Learning on a Torus,” in Cristina 
Bicchieri, Richard Jeffrey, and Bran Skryms, eds., The Dynamacs of Norms (New 
York: Cambridge, 1997), pp. 87-107. 

* We find that behavior is not much different if we use the von Neumann 
neighborhood: N, S, E, W, or a larger Moore neighborhood. 
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demand-one-half players. Furthermore, convergence was remarkably 
quick. Mean time to fixation of fair division was about sixteen genera- 
tions. This may be compared with a mean time to convergence” in dis- 
crete replicator dynamics of forty-six generations, and with the 
ultralongrun character of stochastically stable equilibrium. 

It is possible to exclude fair division from the possible initial strate- 
gies in the divide-ten-dimes game and start at random starting points 
that include the rest. If we do this, all strategies other than demand- 
four dimes and demand-six dimes are eliminated and the four-six 
polymorphic population falls into a “blinking” cycle of period two. 
If we then turn on a little bit of random experimentation or “muta 
tion” allowing the possibility of demand-five, we find that as soon asa 
very small clump of demand-five players arises, it systematically grows 
until it takes over the whole population—as illustrated in figure 1. 
fustice ts contagious.” 
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Figure 1: The steady advance of fair division 





IV ROBUSTNESS 
The bargaining-with-neighbors model of the last section differs from 
the bargaining with strangers model in more than one way. Might 
the difference in behavior that we have just described be due to the 
imitate-the-most-successful dynamics rather than the neighbor 
effect? To answer this question, we ran simulations varying these fac- 
tors independently. 

We consider both fixed and random neighborhoods. The models 
using fixed neighborhoods use the Moore (8) neighborhood de- 
scribed above. In the alternative random-neighborhood model, 
each generation a new set of “neighbors” is chosen at random from 


1! At 9999 level to keep things comparable. 
3 Ellison (op. at) found such contagion effects in local interaction of players 


arranged on a circle and playing pure coordination games 
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the population for each individual. That is to say, these are neigh- 
borhoods of strangers. 

We investigated two alternative dynamics. One imitates the most 
successful neighbor as in our bargaining-with-neighbors model. The 
other tempers the all-or-nothing character of imitate-the-best. Un- 
der it, an individual imitates one of the strategies in its neighbor- 
hood that is more successful than it (if there are any) with relative 
probability proportional to their success in the neighborhood. This 
is a move in the direction of the replicator dynamics. 





Table 2: Convergence results for five series of 10,000 mals 


In table 2, A and B are bargaining with neighbors, with imitate-the- 
best-neighbor and imitate-with-probability-proportional-to-success 
dynamics, respectively. The results are barely distinguishable. Cand 
D are the random-neighborhood models corresponding to A and B, 
respectively. These results are much closer to those given for the 
replicator dynamics in table 1. The dramatic difference in conver- 
gence to fair division between our two models is due to the structure 


of interaction with neighbors. 
V ANALYSIS 


Why is justice contagious? A strategy is contagious if an initial 
“patch” of that strategy will extend to larger and larger patches. The 
key to contagion of a strategy is interaction along the edges of the 
patch, since in the interior the strategy can only imitate itself.” 


* For this reason, “frontier advantage” is used to define an unbeatable strategy in 
Man Eshel, Emiha Sansone, and Avner Shaked, “Evolutionary Dynamics of Popula 
tons with a Local Interaction Structure,” working paper (Univernty of Bonn, 1996). 
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Consider an edge with demand-five players on one side, and play- 
ers playing the complementary strategies of one of the polymor- 
phisms on the other. Since the second rank of demand-five players 
always meet their own kind, they each get a total payoff of forty from 
their eight neighbors. Players in the first rank will therefore imitate 
them unless a neighbor from the polymorphism gets a higher payoff. 
The low strategy in a polymorphic pair cannot get a higher payoff. 
So if demand_-five is to be replaced at all, it must be by the high strat- 
egy of one of the polymorphic pairs. 

In the four-six polymorphism—the polymorphism with the great- 
est basin of attraction in the replicator dynamics—this simply cannot 
happen, even in the most favorable circumstances. Suppose that we 
have someone playing demand-six in the first rank of the polymor- 
phism, surrounded on his own side by compatible demand-four 
players to boost his payoff to the maximum possible.“ Since he is in 
the first rank, he faces three incompatible demand-five neighbors. 
He has a total payoff of thirty while his demand-five neighbors have a 
total payoff of thirty-five. Demand-five begins an inexorable march 
forward as illustrated in figure 2. (The pattern is assumed to extend 
in all directions for the computation of payoffs of players at the pe- 
riphery of what is shown in the figure.) 


Initial Iteration 1 
5544 5354 
5344 5554 
5564 Ea S5534 
5544 $554 
5544 5554 


Figure 2: fair divinon versus four-six polymorphism 


4 In situating the high strategy of the polymorphic pair in a sea of low-strategy 
players, we are creating the best-case scenario for the advancement of the polymor- 
phum into the patch of demand-five players. 
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If we choose a polymorphism that is more extreme, however, it is 
possible for the high strategy to replace some demand-five players 
for a while. Consider the one-nine polymorphism, with a front line 
demand-nine player backed by compatible demand-one neighbors. 
The demand-nine player gets a total payoff of forty-five—-more than 
anyone else—and thus is imitated by all his neighbors. This is 
shown in the first transition in figure 3. 


Initial Iteration 1 Iteration 2 Iteration 3 
S51 11 55511 55551 55555 
55711 55571 55559 55559 
55111 59991 55999 55599 
55911 5999] 55999 55599 
55111 59991 55999 55599 
55711 5551] 55559 55559 
55111 55511 5555] 55555 


Figure 8: fair diviaion versus one-nine polymorphism 


But the success of the demand-nine strategy is its own undoing. Ina 
cluster of demand-nine strategies, it meets itself too often and does 
not do so well. In the second transition, demand-five has more than 
regained its lost territory, and in the third transition it has solidly ad- 
vanced into one-nine territory. 

Analysis of the interaction along an edge between demand-five 
and other polymorphisms is similar to one of the cases analyzed 
here.“ Either the polymorphism cannot advance at all, or the ad- 
vance creates the conditions for its immediate reversal. A complete 
analysis of this complex system is something that we cannot offer. 
But the foregoing does offer some analytic insight into the conta- 
gious dynamics of equal division in “bargaining with neighbors.” 

VI. CONCLUSION 
Sometimes we bargain with neighbors, sometimes with strangers. The 
dynamics of the two sorts of interaction are quite different. In the bar- 
gaining game considered here, bargaining with strangers—modeled by 
the replicator dynamics—-leads to fair division from a randomly chosen 
starting point about sixty percent of the time. Fair division becomes 


the unique answer in bargaining with strangers if we change the ques- 


* With some minor complications involving tes. 
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tion to that of stochastic stability in the ultra-long-run. But long ex- 
pected waiting times call the explanatory significance of the stochastic 
stability result into question. 

Bargaining with neighbors almost always converges to fair division 
and convergence is remarkably rapid. In bargaining with neighbors, 
the local interaction generates clusters of those strategies which are 
locally successful. Clustering and local interaction together produce 
positive correlation between like strategies. As noted elsewhere,” 
positive correlation favors fair division over the polymorphisms. In 
bargaining with neighbors, this positive correlation is not something 
externally imposed but rather an unavoidable consequence of the 
dynamics of local interaction. As a consequence, once a small group 
demand-half players is formed, justice becomes contagious and 
rapidly takes over the entire population. 

Both bargaining with strangers and bargaining with neighbors are 
artificial abstractions. In initial phases of human cultural evolution, 
bargaining with neighbors may be a closer approximation to the ac- 
tual situation than bargaining with strangers. The dynamics of bar- 
gaining with neighbors strengthens the evolutionary explanation of 
the norm of fair division. 

JASON ALEXANDER 
BRIAN SKYRMS 
University of California/Irvine 


8 Skyrma, “Sex and Justice,” this JOURNAL, XCI, 6 (June 1994): 305-20 and Euotu- 
tion of the Sonal Contract (New York: Cambridge, 1996). 
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THE MORAL BASIS OF POLITICAL LIBERALISM* 


he exchange of essays that Jurgen Habermas and John Rawls 

published several years ago in this JOURNAL was a long-awaited 

event.’ At last, the two greatest political philosophers of our 
time turned to comment directly on each other’s conceptions of po- 
litical legitimacy and social justice, the nature of citizenship, and the 
goals of political argument. We now know what Habermas and 
Rawls each believe, rightly or wrongly, are the strengths and weak- 
nesses of the other’s philosophy. On the whole, Habermas sought to 
accentuate their differences, while Rawis, though noting significant 
disagreements, tried to underscore the common elements in their 
views. I believe that in reality the two thinkers are far closer to one 
another than Habermas realizes, but that their proximity is not of 
the form that Rawls imagines. To me it seems that their fundamen- 
tal point of convergence involves a common failing. Each of them, 
aiming at a similarly “freestanding” or “autonomous” conception of 
political life, misses its underlying moral foundations. 

In Political Liberalism, Rawls? declares that a political conception is free- 
standing, if it looks only to the principles that should govern the polit- 
cal life of society. It does not present itself as applying to the political 
realm a comprehensive doctrine about the ends of life (PL 12). Noth- 
ing, of course, prevents such a political conception from being inte- 


* [ thank Daniel Brudney, David Estlund, Alessandro Ferrara, Rainer Forst, and 
Andrew Wallace, as well as the members of my seminar at the University of 
Chicago, for their help on earher versions of this paper 

! Habermas, “Reconciliation through the Public Use of Reason. Remarks on 
John Rawis’s Pohitcal Liberalism,” this JOURNAL, xc, 3 (March 1995): 109-31; and 
Rawis, “Reply to Habermas,” this JOURNAL, xon, 3 (March 1995): 132-80. Subse- 
quent references to these essays will be given in the text, with the page number 
preceded by the abbreviation ‘JP’. 

* New York: Columbia, 1993. References to this book will jj given in the text, 
with the page number preceded by the abbremation PL” í 
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grated into an encompassing moral, religious, or metaphysical view of 
the world. Indeed, Rawls believes that his poktcal kberalism, aiming as it 
does to formulate liberal principles in this freestanding way, is not fully 
Justified unless it forms a common part or “overlapping consensus” of 
the different visions of the good life which are prominent in modern 
society. But at the same time, the basic principles on which political as- 
sociation relies must be ones that can be framed and legitimated with- 
out appeal to those broader considerations. 

I share Rawls’s conviction that a liberal conception of political asso- 
ciation should be freestanding in this sense. For liberal thought is 
best understood, I believe, as responding to an essential ingredient of 
our modern selfinderstanding. On fundamental issues about the 
meaning of life, we have come to expect that reasonable people tend 
naturally to disagree with one another. In early modern times, in the 
course of a century of religious wars, many people came to the 
painful realization that even with the best will in the world they would 
continue to differ about the nature and obligations of the true faith. 
Since then, this conviction has widened in scope. In a free and open 
discussion about the fulfilled life, the human good, or the nature of 
self-realization, it seems that the more we talk, the more we disagree 
(sometimes even with ourselves). On these matters, being reason- 
able—that is, thinking and conversing in good faith and applying, as 
best as we can, the general capacities of reason that pertain to every 
domain of inquiry—tends not to produce agreement but to spark 
controversy. Taking this experience to heart, liberal thinkers have 
concluded that political association should no longer undertake to 
express and foster a conception of the ultimate ends of human exis- 
tence. Instead, it must seek its principles in a minimal morality, 
which reasonable people can share despite their expectably divergent 
religious and ethical convictions. Only so can the political world, gow 
erned as it must be by coercive rules, sanctioned by state power, come 
to more than just the rule of force. Only so can it enjoy the sort of 
transparency in which citizens recognize their political principles as 
the expression of their own will. 

Central though it is to modern experience, the phenomenon of rea- 
sonable disagreement is not easy to explain, for it runs contrary to one 
of the deepest preconceptions of our philosophical tradition. Why 
should reason, on questions of supreme importance, work not to bring 
us together but to drive us apart? Historical contingencies surely play 
an important role. There is the great variety of life experiences created 
by modern Western society, with all its complex divisions of labor and 
rich heritage of many cultural traditions. No doubt, too, the liberal 
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practices of toleration and public discussion have fostered this social 
and cultural heterogeneity, multiplying the very circumstances to 
which they respond. But we might also wonder whether reason does 
not naturally breed discord on difficult questions. To the extent that 
earlier societies achieved agreement about the good, or seem to have 
done so, does this not reflect the limits they placed on open discussionr 
And though the modern sciences do display widespread convergence 
of opinion about complex matters, might this not be because they sub- 
ject the observation of nature to forms of reasoning, based on com- 
trolled experiment and measurement, which are geared precisely 
toward making agreement achievable? 

Explaining the disharmony of reason is bound to be a speculative 
endeavor. Drawing out its consequences, particularly in the political 
realm, is more straightforward. In general, we may say, the aim of 
political order has always involved restraining the struggle for advan- 
tage and the violence of emotion for the sake of achieving a com- 
mon good. In the past, however, this common good was typically 
understood as involving a comprehensive vision of human purposes, 
a vision which belonged to a society’s most treasured knowledge, in- 
terpreted and propagated by a clerisy (often a priestly class) and 
sanctioned by state authority. As the modern appreciation of reason- 
able disagreement has grown, the fundamental tasks of political asso- 
ciation have come to appear more complex. No longer are interest 
and passion alone the chief objects of concern, for the proliferation 
of reasonable views of the good life represents a political problem in 
its own right. Although people may be looking to their own interest 
or may be carried away by the passions of conviction when they wish 
to impose on others their conception of the good, they may also be 
acting in the disinterested belief that political life would be better if 
organized along its lines. It has become a distinctively modern ques- 
tion to ask on what basis reasonable people, thus divided, can live to- 
gether nonetheless in political community. 

I should point out that in referring as I have to reasonableness, I do 
not mean quite what Rawls himself does by this term. He means by it 
the moral commitment to seek and to abide by fair principles of coop- 
eration, particularly in view of the extent to which people tend natu 
rally to disagree in their comprehensive conceptions of human 
flourishing; thus he distinguishes the reasonable from the rational, the 
latter designating the intelligent pursuit of one’s own ends, apart from 
considerations of fairness (PL 48-58). Here, as in previous writings,’ I 


* Asin my book, The Morals of Modernity (New York: Cambridge, 1996) 
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use the term more abstractly to mean the free and open exercise of 
the basic capacities of reason. Reasonable people according to my us- 
age are therefore rational, but they may be reasonable in Rawls’s sense as 
well, if they are concerned to reason about how best to be fair (though 
their commitment to fairness will not follow from being reasonable 
alone). It follows that reasonableness for Rawls denotes a moral re- 
sponse to the predicament, the profusion of disagreement about the 
human good, that arises from the more inclusive disposition I call be- 
ing reasonable. In itself, the difference is only terminological. I do not 
deny that only in virtue of certain moral principles can liberalism give 
the weight it does to what I have defined as reasonable disagreement. 
So far am I from denying it that this moral basis forms the principal 
subject of my essay. 

Liberalism, I have said, entails focusing on what reasonable people 
can still share, despite their differences about the good life. But we 
would badly misunderstand its nature, if we supposed its guiding 
principles to consist simply in whatever might turn out to be com 
mon ground among reasonable people otherwise divided by their 
convictions. More fundamental than the political principles on 
which they will agree is the very commitment to organize political 
life along these lines, to seek principles that can be the object of rea- 
sonable agreement. This commitment forms the moral core of lib- 
eral thought, and it embodies a principle of respect for persons.* 
Such is the main point I want to establish, as I look at the recent ex- 
change between Habermas and Rawls and at the central question of 
their debate: What can it mean for political association to be 
founded on freestanding principles designed to abstract from the 
ongoing disagreements about the nature of the human good? 

L CLASSICAL AND POLITICAL LIBERALISM 
First, some historical background is necessary. We need to under- 
stand why this question should assume the greatest importance for 
liberal thought today. 

From John Locke’s time to our own, liberal thinkers have generally 
presented their political philosophy in terms of a full-scale individual- 
ism, urging a critical detachment toward inherited forms of belief and 


* Despite its philosophical salience, this moral commutment does not, I believe, 
provide a sufficient basis for a liberal polincal order. It cannot explain why people 
who hold this commitment should think of themselves as a people, bound together 
in a common political destiny which distinguishes them from others. This element 
in their self-understanding arises from a shared historical experience, particularly 
the memory of past conflicts which were fueled by the effort to umpose faiths or 
other comprehensive ideals on one another. For some more on this point, see my 


Morals of Modernity, pp. 141-44. 
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cultural traditions. This is not surprising. Individualism has formed a 
pervasive current in our culture. It grows out of basic features of mod- 
em society, particularly the market institutions of a capitalist economy. 
It has also drawn encouragement from the modern experience of rea 
sonable disagreement, as people have concluded that they must there- 
fore determine on their own their vision of the human good; and that 
conclusion has seemed to define the basis on which they must under- 
stand the terms of their collective life. Thus, the classical liberalism of 
Locke, Immanuel Kant, and J. S. Mill argued that the principles of polit- 
ical life should forego appeal to ideals of the good because thereby they 
express the individualist spirit which ought to shape the whole of our 
life. Classical liberals differed, of course, in important ways concerning 
the precise nature of the individualism they embraced. But they agreed 
on the fundamental idea that our allegiance to any substantial view of 
the good life—to any concrete way of life involving a specific structure 
of purposes, significances, and activities (such as a life shaped by certain 
cultural traditions, or devoted to a particular religion)—should always 
be a contingent one, revisable on reflection. Such forms of life can be 
truly valuable, they believed, only if we understand them as ones we 
choose, or would choose, from a position of critical reflection. Most im- 
portantly, they were at one in defending their political principles within 
the framework of this general philosophy of individualism. Our status 
as political subjects or citizens should be independent of whatever view 
of the human good we affirm, because in that way political principles re- 
spect—as Locke, Kant, and Mill would have said—the fallibilist, auw 
tonomous, or experimental attitude which we as persons should 
maintain at the deepest level of our self-understanding. 

Things, however, have not stayed so simple. The individualist view 
of life has itself become an object of reasonable disagreement. Espe- 
cially in the wake of the Romantic movement, there has arisen a new 
sense of the significance of belonging, an appreciation of tradition 
to which the premium that individualism places upon critical reflec- 
tion appears to embody a kind of moral blindness.’ In reality, a re- 
flective attitude toward received forms of life is one value among 
others. To give it supreme authority may therefore block recogni- 
tion of much else that is also of value. Thus it has been held that we 
can share in the good that some ways of life offer, only if we do not 
think of our allegiance to them as elective, as a matter of decision, 


* For a more detailed account of this Romantic theme, see my book, The Roman- 
ite (New York: Columbia, 1996), chapter 2, as well as The Morals of Modernity, 
pp. 127-34. 
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but regard it instead as constitutive of what we hold to be valuable, as 
rooted in a feeling of belonging. The importance of common cus- 
toms, ties of place and language, and religious faith can lie in their 
shaping the very understanding of value on the basis of which we 
make the choices we do. At the very least, is it not difficult to imag- 
ine our deepest moral commitments as ones that, from a critical dis- 
tance, we choose or would choose to affirm? To stand back in 
reflecion from them would seem to require stripping ourselves of 
the resources necessary to guide any sort of moral evaluation. 
Ought they not to be understood, therefore, as the inherited basis of 
deliberation, rather than as objects of choice themselves? 

The praise of belonging, though often presenting its critique of in- 
dividualism as an opposition to the Enlightenment, does not involve 
a reversion to premodern forms of thought. The forms of life with 
which it encourages identification were seldom tradition-minded 
themselves, seeking instead legitimation in some transcendent 
source, such as the voice of Reason or the will of God. Paradoxical 
as it may sound, traditionalism is a modern innovation, and for that 
reason we can expect it to prove an abiding feature of our culture. 
Today, despite the enormous influence they continue to wield, indi 
vidualist modes of thought have turned eminently controversial. No 
doubt they have always been contested, but now their philosophical 
difficulties are open to view. However we settle to our own satisfac- 
tion the merits of individualism and the sense of tradition as concep- 
tions of the good and the right, we cannot deny that on this 
question, reasonable people continue to disagree. 

In this regard, then, liberalism faces a new challenge. Should it 
keep its classical commitment to an individualist view of life? Or, 
taking instead its bearings from that modern experience of reason- 
able disagreement that was in the beginning its guide, should it seek 
a reformulation broad enough to appeal to those who are otherwise 
divided by this controversy? The second path is the one pursued by 
political liberalism, as Rawls and I understand it. The aim is to fix 
the principles of political association in terms independent, not just 
of religious convictions and substantive notions of the good life, but 
of comprehensive moral conceptions, too, to the extent that espous- 
ing individualist ideals or their rejection, they have become an ob- 
ject of reasonable disagreement in their own right. So understood, 
political liberalism does not represent a radical departure from the 
motivations of its classical forebears. The underlying continuity is 
the focus on the phenomenon of reasonable disagreement, and the 
differences stem from experience, as we have learned how pervasive 
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this phenomenon actually is. There may not therefore be much of a 
puzzle as to why political liberalism did not emerge earlier. It took 
the Romantic critique of individualism and its absorption into our 
culture to make plain how controversial reasonable people can find 


the overall moral vision to which classical liberalism has appealed.‘ 
Il. POLITICAL LEGITIMACY AND MORAL RESPECT 


Historical experience provides the impetus for liberalism to shed its 
individualist philosophy. Nonetheless, in assuming the form of what 
Rawls calls a freestanding political conception, liberal doctrine is 
heeding more than just the widening scope of reasonable disagree- 
ment. It is also drawing upon certain moral convictions, which imply 
that this is the proper route to take. After all, we may intelligibly ask 
why liberalism’s response to this controversy should be a reformula- 
tion of its principles. Why should liberalism become political, in the 
sense that Rawls and I intend? Why should liberal thinkers not in- 
stead dig in their heels and, observing correctly that no political con- 
ception can accommodate every point of view, maintain that 
liberalism stands or falls with a general commitment to individual 
ism? The answer must be that the essential convictions of liberal 
thought lie at a more fundamental moral level. So we need to bring 
out what these convictions are. 

Rawls remarks that liberalism seeks the form of a freestanding po- 
litical ideal, because it “applies the principle of toleration to philoso- 
phy itself.” By this he means that, because its goal is a conception of 
justice to which reasonable people can agree, this conception must 
“be, as far as possible, independent of the opposing and conflicting 
philosophical and religious doctrines that citizens affirm” (PL 9-10). 
In other words, the reason why liberalism must abandon its classical 
appeal to an overall individualism and become a strictly political 
doctrine lies in the idea that basic political principles should be ac- 
ceptable to those whom they are to bind. This idea Rawls has come 
to call the liberal principle of legitimacy. “Our exercise of political power 
is fully proper,” he explains, “only when it is exercised in accordance 
with a constitution the essentials of which all citizens as free and 
equal may reasonably be expected to endorse in the light of princi 
ples and ideals acceptable to their common human reason” (PL 


* In giving such t to this histomcal perspective, I may differ from Rawis, 
who finds puzzling the development of politcal liberalism. See his “Reply to 
Habermas,” JP 133, footnote 1. 


606 THE JOURNAL OF PHILOSOPHY 


137)” This principle, I agree, reflects the abiding moral heart of lib- 
eral thought’ But I do not think that Rawls has brought out as dis- 
tinctly as he might the nature of this moral foundation and the 
precise position it occupies in political liberalism. 

To see better what the liberal principle of legitimacy involves, let us 
ask why we believe, if we do, that the fundamental terms of political 
life should be the object of reasonable agreement. Answering this 
question will prove essential, in fact, to determining what reasonable 
agreement should mean in this context. Do we accept the liberal prin- 
ciple of legitimacy, because we think that in general people are bound 
only by moral rules they could not reasonably reject? Whether correct 
or not, this view of morality is far too controversial for the present pur- 
pose. As an explanation of the grounds of moral obligation,’ it risks 
circularity: Do not the reasons we have to accept the fundamental 
rules of morality involve the fact that it is simply wrong not to observe 
them? Moreover, in this form the view seems difficult to detach from 
the broader moral ideal of individual autonomy or self-determination 
toward which political liberalism must strive to be neutral. And if the 
validity of moral rules is instead supposed, not to be explained by rea- 
sonable agreement, but only to imply it,” the view still remains too 
controversial. For there is more than a little sense in holding that a 
person stands under the basic obligations of morality even if (perhaps 
through bad living) he has come to see the world in a way that can 
only be indifferent to them. In any case, general conceptions of moral 
obligation are not, at least directly, the source of our conviction that 
political principles must be able to meet with the reasonable agree- 
ment of the citizens they are to govern. 

That conviction reflects instead the distinctive feature of political 
principles which sets them off from the other moral rules to which we 
may believe people are subject. Moral principles fall into two groups. 
With some we believe people can be rightly forced to comply, but 
others we do not regard as valid objects of enforcement, whatever dis- 


! For other references to this principle, see PL 139-40, 148, 217, 225-26. 

* Following Rawis, I also agree that this principle is meant to govern chiefly the 
choice of basic, constitunonal principles. These pnociples will themselves allow 
for many decisions to be made according to less demanding mules, such as majority 
young, in which indeed appeals to controversial ideals of the good may sometimes 

priate 


be 
UAE S wien ted by T. M. Scanlon in What We Owe to 
Each Othar (Cambridge: Harvard, 1998). 

* This appears to be the basis on which Gerald Gaus argues that political princt 
ples require public justification—/fushiicatory Liberalism (New York: Oxford, 1996), 
pp. 121 
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approval or even outrage we may feel when they are violated. The 
first group alone has the status of political principles. For an associa- 
tion is political insofar as it relies upon the supposedly legitimate use 
of force to secure compliance with its rules. It is this coercive charac- 
ter of political principles which we have in mind, when we hold with 
the assurance that we do, and whatever our allegiance to the general 
ideal of judging people only by rules they could find acceptable, that 
such principles must be the object of reasonable agreement. Our be- 
lief is that only so can the use of force to implement these principles 
be justified. This, incidentally, is why political liberalism, though 
claiming to be a strictly political conception, does not imagine that 
the political domain forms a prepackaged sector of society, inherently 
distinct from the other areas of social life.” It is we who decide what 
will count as pokitcal depending on what elements of social behavior 
we think should be subject to coercion. 

With all these remarks Rawls himself would concur, for at one 
point he notes that the liberal idea of legitimacy rests on the fact 
that political power is coercive power (PL 139). But our analysis of 
this core liberal commitment must go deeper still. We need to make 
clear why it is that the validity of coercive principles should depend 
upon reasonable agreement. 

I believe that the source of this conviction is a principle of respect 
for persons. Let me explain.” Observe first that the use or threat of 
force cannot be deemed wrong in itself, for then political association 
would be impossible. What we must regard as improper is rather to 
seck compliance by force alone, without requiring reasonable agree- 
ment about the rules to be enforced. For consider the basic fact that 
persons are beings capable of thinking and acting on the basis of 
reasons. If we try to bring about conformity to a rule of conduct 
solely by the threat of force, we shall be treating persons merely as 
means, as objects of coercion, and not also as ends, engaging directly 
their distinctive capacity as persons. True, they cannot be moved by 
threats except by seeing that they have good reason to fear what we 
may do. But we shall be appealing to their ability to act on reasons 
simply in order to achieve the goals of compliance—the establish- 


u Habermas makes this crıticiem of Rawls’s political liberalism, wrongly it seems 
to me, in “Reconcilzation through the Public Use of Reason,” JP 129. 
™ Here, I rely on the argument laid out in The Morals of Modernity, pp. 136-41. 


608 THE JOURNAL OF PHILOSOPHY 


ment of public order, perhaps also the reformation of people’s char- 
acter. We shall not be engaging their distinctive capacity as persons 
in the same way we engage our own, making the acceptability of the 
principle depend on their reason just as we believe it draws upon 
our own. Thus, to respect another person as an end is to require 
that coercive or political principles be as justifiable to that person as 
they presumably are to us. This is certainly not the only sense we can 
give to the rich moral notion of respect” But it is the one which lib- 
erals must regard as relevant from a political point of view. (In this 
regard, the extent to which it avoids the peculiarities of the Kantian 
conception of respect is an advantage. 

IIL THE MORAL FOOTING OF A FREESTANDING POLITICS 
Liberalism, formulated as a strictly political doctrine, rests therefore 
on this moral foundation. It forms a freestanding conception in re- 
gard to comprehensive moral visions of the good life, but it cannot 
coherently claim to be freestanding with respect to morality alto- 
gether. In particular, we would be wrong to suppose that the moral 
principle of respect for persons has the political significance it does 
because reasonable people share a commitment to it. On the con- 
trary, the idea of respect is what directs us to seek the principles of 
our political life in the area of reasonable agreement. Respect for 
persons lies at the heart of political liberalism, not because looking 
for common ground we find it there, but because it is what impels us 
to look for common ground at all. 

These reflections lead, I believe, to a very important result. As citi 
zens of a political association organized in accord with the liberal 
principle of legitimacy, we cannot regard the norm of respect as hav- 
ing the same sort of validity as the constitutional principles by which 
we live. Those principles are legitimate in virtue of being the object 
of reasonable agreement, and thus their authority is strictly politcal, 
deriving from our collective will as citizens. Or at least so they are um 


*% Thus, as Willam Galston points out—Leberal Purposes (New York: Cambnidge, 
1991), p. 109-——we respect a person in a different sense when explaining to him 
fully our reasons for the principle by which we judge his conduct, whether or not 
he can appreciate those reasons My point is not to settle what respect “ 
means, for it really means a great many different My concern 1s with what it 

e ht to mean when figuring at the foundations of Hberal thought. Also, I agree 
Rawis—-A Theory of Justice (Cambridge: Harvard, 1971), pp. 585-86—that the 
relevant notion of respect is best circumscribed by referring to characteristic lib- 
eral prinaples (such as the liberal idea of legitimacy). But our aim ought then to 
be understood as determining the sense of respect for persons on which those 
principles reat. 
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derstood for political purposes (for when we refer to our global vi 
sions of the good, we may also attribute to them an independent sta- 
tus). But even in our capacity simply as citizens—relying solely on 
the commitments we share as members of a political community—we 
cannot take this view of the principle of respect. It must instead be 
understood as having more than just political authority. We must 
consider respect for persons as a norm binding on us independent of 
our will as citizens, enjoying a moral authority that we have not fash- 
ioned ourselves. For only so can we make sense of why we are moved 
to give our political life the consensual shape it has. 

It is not clear to me that Rawis understands his own thought in this 
way. To be sure, he thinks of his political liberalism as a “moral con- 
ception” (PL lln). In holding that liberal principles should be for- 
mulated as a strictly political conception, he does not mean to 
suggest any contrast between the political and the moral, as though 
liberal doctrine did not form a normative conception, as it manifestly 
does, consisting in certain ideals, standards, and values. But would he 
be willing to admit that, as citizens reasoning from the standpoint of 
this political conception, we must acknowledge a moral authority 
higher than the political principles we give ourselves? 

No unambiguous answer is available. About the liberal principle 
of legitimacy, Rawls says that it has the same basis as his two princi 
ples of justice: it would be chosen in what he calls the “original posi- 
tion,” being indeed bound up in those very two principles (PL 137n, 
225-26). In other words, Rawls seems to regard that principle of le- 
gitimacy as one whose validity, at least from the political point of 
view, depends on our collective will as citizens, whereas I of course 
have been arguing for just the opposite conclusion. Yet Rawls also 
notes that the original position, which is but a device for represent- 
ing the basis on which we understand ourselves as choosing or im- 
posing on ourselves political principles, incorporates certain values 
in the conditions under which such a choice is to be imagined taking 
place, values which thus are not themselves the object of choice (for 
example, PL 103). Although we do not, Rawls stipulates, imagine 
the parties choosing in the original position as endowed with any 
moral sensibility (they are merely rational, engaged in the efficient 
pursuit of their ends), the fact that we place certain conditions on 
their choice—they are to be supposed ignorant of their own talents 
or conception of the good life, for example—treflects a moral com- 
mitment of our own, namely, a commitment to what he calls reason- 
ableness, the readiness to seek fair principles of cooperation (PL 
305). Political principles apparently have then a moral basis that we 
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cannot conceive as rooted like them in our political will. Moreover, 
Rawls describes this moral commitment of reasonableness as em- 
bodying the resolve to propose political norms justifiable to all (PL 
49, 50); and this brings us right back to the liberal principle of legiti- 
macy. That principle cannot therefore have the same status as the 
two principles of justice; and, as it expresses in effect the idea of re- 
spect for persons, Rawls would apparently agree after all that this 
idea must have a moral authority for citizens that is independent of 
their political will. 

Yet Rawls never makes this point explicitly. On the contrary, his 
stated view is that because of its freestanding character, political lib- 
eralism is “doctrinally autonomous” (PL 98-99). At least in our role 
as citizens (if not in our comprehensive views of the world), we are 
to regard our political principles as deriving their validity from our 
political will. These principles cannot accordingly be understood, 
he holds, as “moral requirements externally imposed” (PL 98). This 
phrase can have various meanings, however. If it means that basic 
political principles are not to be imposed upon a citizenry by some 
external agency—by an enlightened monarch, for example—then I 
agree." But if instead it means that citizens should not regard their 
political principles as drawing upon moral requirements whose valid- 
ity is external to their collective will, then I believe Rawls goes wrong. 
Political liberalism makes sense only in the light of an acknowledge- 
ment of such a higher moral authority. 

In fact, the idea of respect plays this foundational role in several 
ways. As we have seen, it forms the basis for believing that political 
principles should be the object of reasonable agreement. But, in ad- 
dition, it serves to define the very nature of the agreement to be 
sought. To explain this point, I begin by observing that consensus in 
this case is clearly a normative notion if only because it refers to rea- 
sonable instead of simply actual agreement. Political life is to be 
based on principles that citizens, despite their various moral, reli- 
gious, and metaphysical beliefs, can see reason to accept, exercising 
(as I have said) the basic capacities of reason applicable to any do- 
main of inquiry.” But reasonableness (so understood) is not the 


4 In this connection, see Rawk, “Reply to Habermas,” JP 162-63. 

“ The reasonable so understood does not entail uncritical deference to com- 
mon sense and to the formal fallacies of reasoning which ıt may harbor or to the 
refusal to admit the existence of reasonable disagreement about the good which it 
may show. Contrary to Gaus—jfust/icatory Liberalism, pp. 3-5, 181-36—political lib- 
eraliam as I conceive it (or, I believe, Rawls as well) is not popukist in this sense. 
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sole normative criterion defining the sort of consensus in question. 
Because the idea of respect directs us to look for common ground in 
the first place, it should be understood as a further condition which 
acceptable principles must satisfy. The terms of political association 
are to be judged by reference to what citizens would accept, were 
they reasonable and committed to the principle of equal respect for 
persons. The notion of agreement to which political liberalism ap- 
peals is therefore an idealization. It comes into play only within the 
bounds set by these two norms, the one epistemic, the other moral. 
This conclusion is, again, one which Rawls should be willing to ad- 
mit. His liberal principle of legitimacy ties consensus to precisely 
these two assumptions when referring to what citizens in the light of 
“their common human reason” may be “reasonably expected to em 
dorse.” (Recall that his notion of the reasonable embodies the prin- 
ciple of respect.) Yet he is not as clear as he should be about the 


moral foundations of this idea of consensus. 
IV WHAT HABERMAS AND RAWLS SHARE 


Habermas, too, aims to develop an autonomous conception of the 
guiding principles of modern democracy. This ambition inspires the 
defense of radical democracy in his book, Faktizitāt und Geltung: 
Bettrdge zur Diskurstheorte des Rechts und des demokratischen 
Rechtisstaates.* The democratic ideal as Habermas understands it has 
nothing to do with the communitarian dream that our political life 
might once again draw its bearings from a common vision of the hu- 
man good. In this respect, it shares the outlook of political liberal 
ism. But Habermas also opposes it to what he calls the liberal idea 
that individual rights set limits to the exercise of democratic self- 
rule. Liberal thinkers err, he believes, in thus holding political asso- 
ciation accountable to moral norms supposedly given in advance. 
For they thereby fail to heed the intellectual conditions of modern 
times. In an age where religious and metaphysical worldviews have 
lost their authority, we can have reason to consider ourselves subject 
to political principles only if we are able at the same time to see our- 
selves as the authors of these principles: 


* Subrkamp: Frankfurt, 1992, p. 13; English translation by William Rehg: Be 
tween Facts and Norms: Contributions to a Discourse Theory of Law and Democracy (Cam- 
bridge: MIT, 1996), p. xhi. Subsequent references to this work will be given in the 
text, with ‘FG’ preceding the page in the German original and ‘BFN’ the corre- 
sponding page m the translation. In quotations, I have changed the English trans 
lation as I saw fit 
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Without religious or metaphysical support, the coercive law designed 
for legal behavior can preserve its socially integrative force only insofar 
as the addressees (Adressaten) of legal norms may at the same time under- 
stand themselves, in their collectivity, as the rational authors (Urhaber) of 
those norms (BFN 33; FG 51-62). 


In Habermas's view, this kind of collective autonomy (to be distin- 
guished, obviously, from the ideal of individual autonomy, though 
the two are structurally similar) constitutes the source of the basic 
norms shaping political association. We miss the true character of 
the principles of modern democracy, he claims, if we adhere to the 
“classical” (that is, premodern) notion of a normative hierarchy 
whose pinnacle is occupied by independent moral principles to 
which political association must defer (FG 137; BFN 106, 449). Be- 
cause liberal thought, subordinating democratic selfrule to individ- 
ual rights, continues to rely on this hierarchical model, Habermas 
believes that we must move beyond a liberal conception of political 
life. We must be resolutely modern, recognizing that political princi- 
ples have their basis only in the autonomous will of citizens who are 
the full authors of the rules that bind them. 

Habermas often describes his position as one in which fundamen- 
tal individual rights, no longer serving to check democratic self-rule, 
turn out to be co-original (gletchurspriingtich) with it In other words, 
the modern and ancient ideals of liberty, as those two principles 
have come to be called, are not so open to conflict as commonly sup- 
posed.” Sometimes Habermas’s idea seems to be that the two mutt 
ally support one another. Just as self-government can serve to 
protect individual rights, so these rights themselves provide the nec- 
essary means for the exercise of popular sovereignty—the right to 
free expression, for example, making available the wide range of in- 
formation and points of view that permit wise political decisions (FG 
155, 161; BFN 122, 127-28). 

But, in reaching for the idea of co-originality, Habermas has 
clearly a deeper point in mind as well. It is that the two principles 
have a common origin, and that source is, he believes, the autonomy 
by which a community gives shape to its political life. The result is 
that, despite his talk of mutual implication, Habermas unmistakably 
makes democratic self-rule a principle prior in status to that of indi- 
vidual rights. The claim that rights and self-rule are co-original is 


T See Habermas, Dw Exnbenshung des Anderen (Frankfurt Suhrkamp, 1996), p. 298; 
English translation by Ciaran Cronin and Pablo De Greit The Inctuson of the Other: 
Studses in Polttical Theory (Cambridge: MIT, 1991). 
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misleading, in light of what he takes to be the only plausible origin 
of political norms in the modern age. In reality, popular sovereignty 
functions for Habermas as the ultimate basis on which our political 
life should be organized, and so as the true source of individual 
rights, their ultimate justification lying in the way they embody the 
principle of self-rule (cf. FG 134-35; BFN 104). The reason is plain: 
only so can we escape letting the principles of political association 
draw upon moral values having an independent validity. (I come 
back to the details of Habermas's argument in section VI.) 

A striking similarity thus emerges between Habermas’s conception 
of autonomy and Rawis’s demand that political principles be free- 
standing. Indeed, as I have remarked, Rawls himself calls a free- 
standing political conception “doctrinally autonomous” and explains 
this term in much the same way. A political conception is au- 
tonomous, he writes, if “the political values of justice and public rea- 
son (expressed by their principles) are not simply represented as 
moral requirements externally imposed” (PL 98). In their compan- 
ion essays in this JOURNAL, both philosophers emphasize this point of 
convergence (JP 127, 150). 

But it is also obvious that Habermas’s idea of political autonomy is 
the very notion that I have criticized in my discussion of Rawls. Polit- 
ical association, so we have seen him claim, must not be understood 
as drawing its principles from a moral source superior to its collec- 
tive will. In his hands, this conception fares no better than it does in 
Rawis’s. Habermas, too, misses the moral basis that supports the de- 
mocratic ideal of self-rule. To make clear where precisely his mis- 
take lies, I shall first look at some disagreements between the two 
philosophers. In their debate in this JOURNAL, they both noted im- 
portant differences in their handling of this common idea. Rawis re- 
jected the manner in which Habermas arrives at his conception of 
political autonomy. Habermas, in turn, charged that Rawls strays 
from a consistent development of such a conception. Understand- 
ing these points of divergence will put us in a better position to see 
why, contrary to them both, modern democracy rests upon indepen- 


dent moral foundations. 
V. METAPHYSICS AND POLITICS 


As Rawis observes (JP 135-38), political autonomy in Habermas’s 
eyes stems from a broader philosophical position described as past- 
metaphysical In our time, Habermas declares, metaphysical theories 
asserting the existence of entities that are neither physical nor psy- 
chological in nature (“ideal entities” he calls them) as well as reli- 
gious conceptions that see the world as the work of God and the 
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vehicle of providence have lost their plausibility. His point is not 
simply that they no longer enjoy the society-wide authority they once 
possessed, having become the object of apparently irresoluble dis- 
agreement. For Habermas, the age of metaphysical and religious 
worldviews is over in the deeper sense that, properly speaking, ra- 
tional belief in their truth is no longer possible. Reason itself, he be- 
lieves, now presents itself as essentially finite, fallible, procedural, 
and oriented toward intersubjective agreement. It consists in think- 
ing or acting on the basis of reasons we take to be valid, not because 
we imagine them guaranteed by an objective order of the world, but 
because we suppose that they would command the assent of others 
under appropriately ideal conditions. We are to locate the norms of 
reason, no longer in an ideal realm independent of us, but instead 
in the idealizations we necessarily make in speaking with one an- 
other.” A battery of philosophical arguments, assembled under the 
title of discourse theory, aims to bring out the necessity of this post- 
metaphysical shift in our conception of reason and to explain the 
idealized form of discussion (Diskurs) to which reason thus refers. 

Habermas claims that it is in virtue of this general standpoint that 
the idea of political autonomy recommends itself. We are to estab- 
lish the terms of political association by applying this general ac- 
count of reason to the fundamental task of politics, which is to 
determine the scope of the rule of law. Thus, once reason can in- 
voke no higher authority than agreement under ideal conditions, 
the principles of conduct that are to bind us must be such that we 
can see ourselves to be their source. This is the meaning of the pas- 
sage I began by quoting in section Iv. 

Rawls takes his distance from this approach, and rightly so. 
Habermas’s argument has the significant disadvantage that it relies 
on a comprehensive vision of our place in the world and the nature 
of reason’s authority. His postmetaphysical point of view shares, as 
Rawls insightfully notes, the ambition of G. W. F. Hegel’s logic, 
providing “a philosophical analysis of the presuppositions of ra- 
tional discourse...which includes within itself all the allegedly sub- 
stantial elements of religious and metaphysical doctrines” (JP 137). 
Such doctrines will scarcely recognize, much less accept, the form 
into which they are thus translated, for it is one in which, as Haber- 
mas himself remarks, “worldviews are measured more by the au- 
thenticity of the life styles they shape than by the truth of the 


" There is a succinct and lucid account of this hne of thought in FG 24-37; 
BFN 9-21. 
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statements they admit” (JP 126). This idea of postmetaphysical 
reason can be expected therefore to provoke controversies of pre- 
cisely the sort that we must look beyond, so Rawis believes (and I 
agree), if we are to discover acceptable terms of political associa- 
tion. After all, the notion of ideal discussion to which Habermas 
appeals is not self-explanatory, and the transcendent interpretation 
he favors, defining ideal conditions as he does by reference, not to 
the (“local”) standards of belief we currently espouse, but to the 
best standards we ever could have (FG 30-31, 36, 202, 566; BFN 15, 
20, 163-64), is neither the only possibility nor one likely to strike 
many people as anything short of metaphysical. 

In truth, the vitality or obsolescence of metaphysical and religious 
worldviews is an issue on which reasonable people tend naturally to 
disagree. It is fair to observe that today none of these conceptions 
can prove authoritative for society as a whole. Yet to go further and 
say, as Habermas does, that they can no longer figure as objects of 
rational belief, but only as life styles, means adopting a point of view 
that itself is party to these disputes. Habermas has misidentified the 
feature of modern experience that is crucially relevant to the basis of 
political association. The decisive element is not the waning of 
metaphysical and religious worldviews (though that has occurred). 
It is instead the recognition that such worldviews, as well as the re- 
current postmetaphysical efforts to do without them, are an endur- 
ing object of reasonable disagreement. 

As a matter of fact, Habermas’s idea of postmetaphysical reason 
seems to me to go wrong in two important respects.” First, I believe 
that our conception of the world must have room for ideal entities, 
for only if reasons exist (a reason being itself neither physical nor 
psychological in character) can there be such a thing as normative 
knowledge—knowledge of how we ought to act, but also more fun- 
damentally knowledge of how we ought to think. Second, I do not 
think that the notion of ideal discussion forms a substantial part of 
the theory of justified belief. We may say, if we wish, that justified be- 
liefs are ones to which we assume that all would agree in an “ideal” 
discussion. But we can have a grasp of such ideal conditions only if 
we suppose them to embody our best standards of warranted belief, 
and so we might as well say, more directly, that the beliefs are just 


* For more on these two points, see my Morals of Modernity, chapter 5 and pp. 
206-10 
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fied if they satisfy these standards.” In Habermas’s account of justi- 
fied belief, it is true, ideal discussion plays a role that resists elimina- 
tion of this sort, since it is assumed to transcend our current 
standards. But in consequence it loses any discernible content. 

Yet I should emphasize that the main point in the present context 
is not whether Habermas’s conception of reason is correct, but 
whether it can serve as an appropriate basis for establishing the terms 
of our political life. These two questions must be distinguished, once 
we are convinced that political association turns crucially on finding 
principles reasonable people can accept despite their disagreements 
about fundamental matters of life and value. In seeking a solution to 
this political problem, we cannot call upon all that we ourselves may 
reasonably believe to be true about such matters. Habermas gives lit- 
tle attention to the difference between these two perspectives. That 
no doubt is why he runs them together. The result is an account of 
the foundations of political life which, whatever its interest, and what- 
ever the ultimate validity of the conception of reason on which it re- 
lies, is destined to be an object of reasonable disagreement. In this 


regard, Habermas’s political theory falls short of what we need. 
VL HABERMAS'S IDEAL OF POLITICAL AUTONOMY 


Habermas himself locates the crucial difference in their accounts of 
political autonomy elsewhere. In his view, Rawls fails to adhere con- 
sistently to this principle in working out his theory of justice, revert- 
ing to the liberal assumption that there exist individual rights prior 
to democratic self-rule and defining its scope. This backsliding, he 
charges in the exchange in this JOURNAL, arises from the “two-stage 
character of his theory” (JP 128). While Rawis’s idea of the original 
position, in which rational parties choose principles of justice, cap- 
tures something like collective autonomy, it yields a conception of 
the just society in which “liberal rights...constrain democratic self- 
legislation” (JP 128-29). Citizens cannot then “reignite the radical 
democratic embers of the original position in the civic life of their 
society” (JP 128). 

Behind this objection there may well lie, as Rawls suspects (JP 
160), some sympathy with the quixotic idea (once famously voiced by 
Thomas Jefferson) that every generation should give itself its own 


* There is a similar criticism m Albrecht Wellmer, Sppe a Shenoy 
Suhrkamp, 1986), pp. 70-80; English translation by David Midgley in in The 
Persistence of Modernity: Essays on Assthaics, Ethics, and Postmodernism (Cambridge: 
MIT, 1991). 
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constitution, unfettered by the past decisions of those now dead. But 
I believe that the gist of Habermas’s complaint has little to do with a 
call for permanent revolution. It concerns instead the understanding 
we should have of the basis of our political association. His view is 
that citizens must be able to see all their political principles, even 
those establishing individual rights, as rooted in their autonomous 
political will. Such a self-understanding is blocked by a conception 
that gives rights, as he believes Rawis’s theory does, a status prior to 
the democratic process. This is the point at which liberalism and 
Habermas’s own theory of radical democracy supposedly part ways. 

What is the force of Habermas’s criticism? I am inclined to think that 
he terms inconsistency what I have called ambiguity. Habermas objects 
that Rawis fails to carry through consistently an autonomous or free- 
standing conception of political life. But perhaps another way to put 
the point is to say, as I have argued, that Rawis’s idea of a freestanding 
political conception is ambiguous. Sometimes he implies and some- 
times denies that his strictly political doctrine rests on the basis of a 
moral principle of equal respect. To the extent that he admits this 
truth, his understanding of a freestanding political conception does not 
really correspond to what Habermas calls autonomy, which excludes 
this sort of normative hierarchy. In any case, it is clear that Habermas 
and I take opposite views of these tensions in Rawls’s thought 

But there is a further complication. Habermas’s own political the- 
ory does not hold consistently to the conditions he assigns to the no- 
tion of autonomy. In fact, his conception of political autonomy 
makes sense only on the assumption that self-rule rests on an under- 
lying, moral norm of respect. Nor is that astonishing. For only on 
such a moral basis can popular sovereignty take the form that we 
prize as modern democracy. Moreover, this moral foundation in- 
volves, contrary to Habermas’s stated intention, an idea of individual 
rights that precedes and defines the exercise of self-rule. In the end, 
therefore, the sharp opposition he constructs between liberalism 
and his own conception of radical democracy, and thus the criticism 
he makes of Rawls, lose their raison d'être. Habermas, I believe, mis- 
understands himself. It is to developing these points that I now turn. 

To begin, let us examine more closely the way Habermas under- 
stands political autonomy. In explaining how the autonomy charac- 
teristic of modern democracy must proceed without relying upon 
any antecedent moral norms, Habermas appeals to what he calls dis- 


course principle D: 
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Just those action norms are valid to which all possibly affected persons 
could agree as participants in rational discourses (Diskursen) (BFN 107; 
FG 138). 


Modern democracy has its basis in the collective autonomy of its citi- 
zens, Habermas declares, because in determining the principles by 
which they will govern themselves, they defer to no pregiven norms 
other than just this principle D. “Nothing is given prior to 
(vorgegeben) the citizens’ practice of self-determination (Selbstbestum- 
mungspraxis) other than the discourse principle” (BFN 127-28; FG 
161). In contrast to liberal theories, which appeal to antecedent 
moral norms (in the form of individual rights) to limit the scope of 
self-rule, radical democracy, as Habermas conceives it, looks to no 
higher authority than its own self-legislative activity as governed by 
principle D. Rights guaranteeing individual freedoms (such as 
speech and assembly) have their basis, he claims, in the application 
of the principle to the very form of a legal system, whereas the rights 
of political participation serve to make possible the institutionaliza- 
tion of this sort of self-rule (FG 154-62; BFN 121-28).” 

My concern lies, not with the details of this “logical genesis of 
rights,” but with the status of D itself. It is crucial for Habermas’s 
conception of radical democracy, as he makes clear (FG 138; BFN 
107), that principle D possess no moral content of its own.” Morality 
and democracy, he holds, involve distinct and equally basic applica- 
tions of principle D, the difference consisting in whether the princi 
ple is to shape the interactions between human beings as such or is 
to take a legal form binding for a specific political community. Cer- 
tainly, there is a significant overlap between the norms thus gener- 
ated in these two domains. But in Habermas’s view, this does not 
change the fact that the terms of political life have their justification 
solely in the autonomous will of the citizens, as expressed according 
to principle D, and not in any moral principles that citizens must rec- 
ognize as setting limits to their will. 

This is the point, however, where Habermas displays an incom- 
plete understanding of his very devotion to democratic self-rule. 
Observe that popular sovereignty itself may take different forms, de- 
pending on how the popular will is supposedly determined and 
given authoritative expression. Even modern dictatorships, since un- 


€ See also Habermas, Die Einbexsshung des Anderen, pp. 299-301. 
™ As Habermas observes (FG 140; BFN 108), this thems of Faktestat und Geltung 
represents a departure from his earlier writings. J think it 1s a wrong move. 
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like premodern monarchies they disclaim dynastic or divine legit 
macy, are a phenomenon of the modern democratic age: they too 
draw their authority from the will of the people, as they choose to 
understand it.” When conceived as heeding a principle such as D, 
popular sovereignty takes on a form distinctive precisely by virtue of 
its moral character, for all that Habermas may protest to the com 
trary. D, he admits, possesses “normative” content, even if it is nota 
moral principle (FG 138; BFN 107). That is surely a feeble distinc- 
tion. D has moral content, and we can bring it to light by asking the 
simple question: Why should we believe, as this principle requires, 
that norms of action must be rationally acceptable to all whom they 


are to bind? Whence arises the authority of principle D itself? 
VI. DEMOCRACY AND LIBERALISM 


Habermas himself never provides an adequate answer to this ques- 
tion. He suggests that such a principle is inevitable, once we realize 
that in our age metaphysical and religious conceptions, characteristi- 
cally inclined to invoke principles of conduct of which we are not 
the source, have lost their rational warrant. But this response will 
not work on a variety of counts. As I have already indicated, it is rash 
to imagine that reason today stands uncontroversially opposed to be- 
lief in the truth of metaphysical or religious worldviews. A more ac- 
curate diagnosis of our situation is that such visions can no longer 
enjoy reasonable agreement within society as a whole. We might 
well take this fact to entail that metaphysical and religious concep- 
tions should not figure among the principles that bind us together 
politically. But note that such a conclusion is valid, only if we already 
accept some principle such as D; it cannot serve to justify that princi 
ple. Observe further that, if we refuse a political role to metaphysi+ 
cal and religious woridviews on some other grounds— because, for 
example, we believe, as Habermas does, that they can no longer be 
rationally held to be true, we shall not thereby have reason to adopt 
principle D. Why should we not suppose that without appeal to such 
ultimate truths the problem of political association is insoluble? Or 
why should we not organize our political life around some principle 
other than D, such as the maximization of the general welfare? 
Sometimes Habermas claims that the principle enjoys a privileged 
status because it inheres in the very idea of discussing with others the 


> This pomt was rightly made, however dimngenuously for his own purposes, by 
Carl Schmitt, The Grins of Parhamentary Democracy (Cambridge: MIT, 1985), p. 28 
(German original: Dw geistespeschichthcha Lage des heutigen Parlamentansmus (Berlin: 
Duncker and Humblot, 1923 and 1926)). 
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norms to which our actions should conform, that idea being the only 
common resource we can count on in an age of controversy.* This 
suggestion continues Habermas’s longstanding interest in basing 
morality on “universal pragmatics.” I doubt, however, that so much 
can be gotten for so little. D is too substantial a principle, and pre- 
cisely because of its significant moral content, to be implicit in the 
mere idea of practical discussion. 

A good way to see this is to reflect that, taken as a general princi 
ple applicable to all domains of life, D is not even obviously correct. 
Of course, the very meaning of D is obscure so long as the notion of 
“rational discourse” is left undefined, and Habermas’s own wish to 
understand that notion in abstraction from any specific standards of 
belief does not help. But however we explicate the idea of rational 
agreement, principle D in its general form faces the sort of objec- 
tions I mentioned in section u. Thus, many people believe indeed 
that we should not judge others by rules that on due consideration 
they would not themselves accept. But many, too, believe the oppo- 
site. They are convinced that certain fundamental norms of con- 
duct, possessing an objective validity, can serve as a basis for judging 
others, whether or not they meet with agreement. I myself subscribe 
to such a view. But what should be clear is that the general validity 
of a principle such as D is a matter on which reasonable people dis- 
agree. Equally plain is the fact that this disagreement turns on dif- 
ferent moral convictions about the conditions under which we may 
judge others morally and no doubt, too, on different appreciations 
of the moral ideal of individual autonomy. This suffices to show that 
contrary to Habermas, D, taken as a general principle, has a moral 
content and a controversial one at that. 

But let us go further and look at D in the political role it is sup- 
posed to play. Recall a point made earlier: norms of conduct are of 
two kinds. There are, on the one hand, norms that we invoke solely 
to judge the conduct of others, and, on the other, norms that we be- 
lieve should be backed up by the threat and use of force to ensure 
compliance. So let us consider principle D as applied simply to 
norms of the second kind, that is, to poktical principles. Now its com- 
plexion is rather different. It need not be rejected by someone who 
believes that we may rightly judge others in the light of norms that 
they themselves would not accept. For it requires only that the prin- 


a , for example, most recently Habermas, Dis Eznbenshung des Anderen, pp. 
58-59. 
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ciples of our political life, being coercive, be rationally transparent 
to those whom they are to bind. In this limited, political form, D 
thus amounts to the conviction that Rawls calls the Aberal principie of 
lentimacy. As a result, however, it cannot be right for Habermas to 
regard D as independent of any antecedent moral commitments. 
For, as I have argued, the liberal principle of legitimacy embodies a 
moral principle of respect for persons. This moral foundation is 
what gives D the political authority it enjoys. Thus, Habermas’s con- 
ception of democracy fails to be autonomous in the absolute way he 
desires. If we believe our political life should be organized by some 
principle such as D, that is only because we embrace the moral prin- 
ciple of equal respect for persons. 

The sort of normative hierarchy that Habermas dismisses as pre- 
modern shapes in fact his own thinking. Self-government, so far as 
it admits only those political principles which can be the object of 
reasonable agreement, rests on an independent moral basis. We 
cannot regard respect for one another as just one more political 
principle among others whose validity derives from our political will. 
On the contrary, this principle justifies the ideal of self-government 
and defines for us what it means. Reasonable agreement must in 
this context be understood (as I argued in section I) as circum- 
scribed by the moral principle of respect. Comprehensive moral 
views of the ends of life being eminently controversial, our political 
life cannot seek its basis in them. About this point Habermas is 
right. But we cannot suppose, as he does, that our political life 
should be similarly freestanding or autonomous with respect to 
morality altogether. Citizens can understand themselves as the cok 
lective author of their political principles, only if they see themselves 
as already bound by the claims of respect for persons. 

This moral principle refers, moreover, to an individual right, even 
if one more fundamental than the political rights that usually figure 
among explicit constitutional guarantees. Every person has the 
right, it claims, to be bound only by political principles whose justift 
cation he can rationally accept. As a result, Habermas’s position 
comes down to one in which, contrary to his own account, an ind? 
vidual right sets limits to democratic self-rule, determining as it does 
the sort of expression of the popular will that shall count as democra- 
tic. The familiar constitutional rights of free-expression, property, 
and political participation, though no doubt serving to promote the 
goal of democratic self-rule, also have an independent rationale. 
They draw upon that most fundamental of individual rights, which is 


the right to equal respect. 
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In the Postscript to the English translation of Faktizitat und Geltung, 
Habermas has replied to an earlier version of this criticism.” To un- 
derstand the normative basis of modern democracy, he argues, we 
must distinguish the horizontal sociation of citizens, in which they con- 
stitute their collective will and in so doing mutually accord rights to 
one another, from the subsequent step in which they set up rights to 
protect individuals from the political power so constituted. There- 
fore, individual rights against the state “are not ongtnary but rather 
emerge from a transformation of individual liberties that were at first 
reciprocally granted” (BFN 457)—that is, established in the very for- 
mation of their collective will. 

I am not sure how neatly these two steps can be distinguished 
from one another, but I shall let that go. My concern lies with the 
first step. Observe that Habermas himself describes the horizontal 
sociation, which he holds to be foundational, as a process where cith 
zens “recognizing one another as equals, mutually accord rights to 
one another” (BFN 457). Whence, I ask, comes their recognition of 
one another as equals? As Habermas’s very words imply, this is nota 
commitment they acquire in virtue of such association, but rather 
one which defines the sort of association they understand themselves 
to be forming. This commitment amounts in fact to precisely what I 
have been calling the principle of respect. And that principle em- 
bodies an individual right—a right requiring that the terms of polit- 
cal association to be developed be as transparent to one’s own 
reason as to that of others. Such a right does not arise through be- 
ing mutually granted by citizens as they form their collective will. It 
has an independent authority which individuals must acknowledge if 
they are to form a democratic will in the way Habermas envisions. 

I see no reason, therefore, to change my conclusion that Haber- 
mas’s notion of radical democracy is not really so radical that it differs 
materially from the idea of liberal democracy to which he imagines 
himself opposed. Failing to note the moral basis of his own commit- 
ment to democratic self-rule, he slips past the fact that he too assumes, 
if only implicitly, the antecedent authority of individual nghts. 

VIIL FREEDOM AND MORALITY 
Modern democracy is government by self-rule, animated by the ideal 
that the basic rules of political life should draw their legitimacy from 
being the object of reasonable agreement among those whom they 


= I first presented this criticism in a review of Fakimutat und Geltung, “Die 
Wurzeln radikaler Demokratie,” Deutsche Zeitschnft für Philosophie, XLI (1993): 
$21-27. I developed it further in Morals of Modermity, chapter 10. 
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are to bind. But we misunderstand the nature of the democratic 
ideal, I have insisted, if we suppose that for it, the collective will of 
the citizens constitutes the ultimate source of authority. Looking 
more closely at the notion of the collective will employed, we see 
that it is conceived in moral terms. Popular sovereignty can be un- 
derstood as manifesting itself through reasonable agreement, only if 
it is defined as heeding the obligation of respect for persons. 
Democracy is thus a moral conception, and not just in the trivial 
sense that the principles and values by which a democratic people 
organize their political life are recognizably moral in character. 
More profoundly, democracy involves commitment to a moral prin- 
ciple that citizens must see as binding on them independent of their 
democratic will. Respect for persons is what gives their democratic 
will the normative shape it has. 

I have claimed that Habermas misses this moral basis of modern 
democracy and that Rawls, too, though recognizing it in part, fails to 
bring it out as distinctly as he should. My point, as I hope is plain, is 
not to reject the project of working out a conception of liberal 
democracy that is politically freestanding. That project I share. But 
I believe its moral presuppositions should be made explicit. Why, we 
may ask, should there be such reluctance to admit these assump- 
tions? One reason is not far to seek. Liberal thinkers who continue 
to sympathize with earlier individualist versions of liberal democracy 
do not hesitate to expound the moral foundations of this ideal as 
they see them.” But both Habermas and Rawls, in their different 
ways, aim to work out a political conception that does not depend 
upon comprehensive moral views, especially individualist ones, 
about the ends of life. In the effort to devise a political conception 
that is thus freestanding, it is only too easy to slide over the moral 
commitments propelling this very enterprise. 

I do think it important that the principle of respect does not ex- 
press or entail a comprehensive moral philosophy. It has its place in 
a great many otherwise disparate ideas of the human good. Particu- 
larly significant is the fact that it can figure in those conceptions of 
life which refuse to accord supreme value to critical reflection and 
call instead upon forms of moral allegiance that are rooted in a sen- 
timent of belonging. We may feel that our deepest commitments 
are constitutive rather than elective, fashioning our very sense of the 


* See, for example, Jeremy Waldron, Liberal Rights (New York: Cambridge, 
1993), pp 56-57, 163-68. 
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persons we are and the choices we can understand ourselves as mak- 
ing, and at the same time believe that political principles, relying as 
they do upon coercion, must be reasonably acceptable to all whom 
they are to bind, even to those who differ with us on the relative im- 
portance of critical reflection. There exist, of course, conceptions of 
the human good which reject a principle of respect for persons. No 
doubt, many of them also desire to limit critical reflection and 
preach instead obedience to traditional, often religious authorities. 
But these two views do not go necessarily together. In fact, individu- 
alist views of life, too, when they exalt the will to power of creative in- 
dividuals and heap contempt upon the bulk of mankind, can prove 
illiberal. A reverence for tradition is not inherently hostile to liberal 
ideals and more than the cultivation of individuality is intrinsically 
friendly to them. 

But it also bears noting that reasonable people, exercising the ba- 
sic capacities of reason and conversing with others in good faith, are 
not obliged on this basis alone to believe that the use of force should 
remain within the bounds of consensus. Being reasonable (in my 
sense of the term) does not entail the principle of respect for per- 
sons. If the moral basis of liberal democracy stands free of any com- 
prehensive conception of the good, it does not on that account 
become inherently universal. Its appeal can extend only so far as 
people happen to be committed to the principle of respect.” We 
ourselves may find this commitment so obvious that, like Habermas 
and Rawis, we fail to recognize or mention it. We may simply look 
through it, as we reflect upon the constitutional rules a democratic 
people should give themselves to govern their political life. But in 
reality, this commitment forms the specific moral “we” we are, even 
as we remain divided by important differences about other funda- 
mental matters. Self-understanding, here as elsewhere, requires that 
we turn our mind to what lies so close to us as to go unseen. Then 
we can comprehend why this moral outlook, transparent though it 
may be to us, is not universal and has met indeed with sincere rejec- 
tion from others. 

Yet I wonder whether there is not another reason why Habermas 
and Rawis do not make explicit the moral basis of their concep- 
tions of liberal democracy. Perhaps in their case, as certainly in 
the case of others, another contemporary ideal has come to stand 


7 I agree therefore with Stanley Fish—"“Mission Impossible: Settling the Just 
Bounds Between Church and State,” Cohsninca Law Remes, xcvi, 8 (December 1997): 
2255-833—that liberalism, like any political doctrine, must involve exclusion. All the 
sare, a distinctive virtue of pohtical liberalism is the breadth of its inclusiveness. 
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in the way of seeing our deepest commitments for what they are. 
In our time, freedom of self-determination, as both an individual 
and a collective value, enjoys a tremendous prestige. It commands 
so ready an allegiance that all other values can seem of subordinate 
importance. Individuals can be bound, it is said, only by the rules 
they give themselves. So, too, we continually hear that, in their col- 
lective capacity as members of a political community, citizens are to 
determine themselves the principles by which they will live.” I 
doubt that the moral life in general can make much sense if its 
roots are sought in freedom as a supreme value.” I am certain, as I 
have argued in this essay, that our commitment to democracy or 
political self-determination cannot be understood except by appeal 
to a higher moral authority, which is the obligation to respect one 
another as persons. 

CHARLES LARMORE 
University of Chicago 


= Cf. Habermas, Dw Emberiekung des Anderen, p. 301: "Menschenrechte mdgen 
moralisch noch so gut begrimdet werden kdnnen, sie dürfen aber einem Souverin 
(here, the democratic will, C_L.] nicht paternahstiech übergestülpt werden... Dieser 
Idee widersprache es, wenn der demokratische Verfassungsgesetrgeber die Menm 
schenrechte als so etwas wie moralische Tatsachen schon vorfinden wide, um ale 
nur noch zu positivieren.” 

* For reflections along these lines, see The Morals of Modernity, pp 87-88, and 
“L'autonomie de la morale,” Phtlasophiques (Québec), xxiv, 2 (automne 1997): 
313-28. 
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Mind, Language, and Society: Philosophy tn the Real World. JOHN SEARLE. 
New York: Basic Books, 1998. x + 173 p. Cloth $21.00. 


John Searle describes this book as a summary explanation of his 
views on mind, language, and society, and on “how they fit into our 
overall contemporary conception of the universe.” As such, it is a 
great success. Searle’s prose is lucid, but vigorous and muscular. He 
has things to say, and he says them well. He also lays out a great deal 
of territory, so the book is eminently usable as an introduction not 
only to Searle’s own ideas, but to many important themes in contem- 
porary philosophy of mind and social philosophy. Anyone who has 
tried to write introductory books on these subjects will know how dif- 
ficult it is to achieve that 

Searle starts with a chapter laying out the basic tenets of his meta- 
physics. He defends realism against idealism, skepticism, and rela- 
tivism. His realism is composed of a number of “default” positions 
or things that are taken for granted, and forms what he calls the 
background of our thoughts and language. The background includes 
the presuppositions that there is a real world, that we can perceive it 
and know about it, that our words have clear meanings, that state- 
ments are true when they correspond with the way things are, and 
that causation is a real relation between events. Searle fends off ide- 
alist, relativist, or skeptical challenges to the background in fairly fa 
miliar ways. Indeed, he thinks that the arguments presented by 
those who advance such challenges are so feeble that they call for a 
different diagnosis, which he finds in the “will to power.” If some 
kind of relativism or constructionism is true, then it is we who are in 
charge, rather than the world which is in charge of us. 

Chapter 2 defends the existence of mind as a biological phenome- 
non, while chapter 3 continues with the structure of consciousness, in- 
chiding interesting material on the unity of consciousness. Subsequent 
chapters cover intentionality, the structure of the social universe, and 
the nature of language. On all these topics, Searle has interesting 
things to say, often convincingly, sometimes controversially. 

The avowed centerpiece of the book is Searle’s rejection of both 
materialism and dualism. It is this which underpins his claim to be 
giving a unified metaphysics. For once mind is firmly established in 
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nature, the rest is, as it were, a question of mopping up. The inten- 
tional powers of the mind, and the construction of a complex social 
world that includes language, are placed on the secure foundation 
provided by a nonmysterious theory of consciousness. So everything 
depends on this. Yet in spite of the brisk confidence with which 
Searle proposes his theory, it is actually very hard to see just what it 
is, and harder still to see how it triumphantly solves the problems 
that bedevil others. 

Searle begins by providing sympathetic accounts of the pressures 
that lead both to dualism and to materialism. In motivating dualism, 
he particularly stresses the inner, qualitative, and subjective nature of 
conscious states, or their “first-person ontology.” It is this first-person 
ontology which is disregarded or distorted by “materialist,” or objec- 
tive, scientific, reductions of consciousness, such as behaviorism, 
functionalism, or physicalism. Since dualism and materialism con- 
flict, the solution is to transcend the categories: “In short, the way to 
reply to materialism is to point out that it ignores the real existence 
of consciousness. The way to defeat dualism is simply to refuse to ac- 
cept the system of categories that makes consciousness out as some- 
thing nonbiological, not a part of the natural world” (52). The 
upshot is given in six propositions, which I quote in full: 


[1] Consciousness consists of inner, qualitative, subjective states and 
processes. It has therefore a first-person ontology. [2] Because it has a 
first-person ontology, consciousness cannot be reduced to third-person 
phenomena in the way that is typical of other natural phenomena, such 
as heat, hquidity, or solidity. [3] Consciousness is, above all, a biologi 
cal phenomenon. Conscious processes are biological processes. [4] 
Conscious processes are caused by lowerlevel neuronal processes in the 
brain. [5] Consciousness consists of higherievel processes realized in 
the structure of the brain. [6] There is, as far as we know, no reason in 
principle why we could not build an artificial brain that also causes and 
realizes consciousness. [Format altered by author. ] 


Searle adds a closure clause: “that is it” (53). 

But the matter cannot be left to rest there, since if it does the 
doubt remains that Searle has simply seized some attractive features 
of each of dualism and materialism, without doing anything to show 
how their combination is intelligible. Thus, consider proposition 
[5]. In explaining this, Searle uses a popular kind of example, 
claiming that consciousness is “a higher-level feature of the brain in 
the same way that digestion is a higher-level feature of the stomach” 
(52). Many philosophers of mind would applaud this comparison. 
But they would regard it as a vindication of some kind of physicalism 
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or functionalism. Their mistake, says Searle, is to forget or ignore 
proposition [1]. But then the question is why, if[1] is incompatible 
with materialism, it is nevertheless compatible with [5]. Digestion, 
as Searle admits, is an objective, third-person phenomenon. All 
Searle says here is that it is just a fact that some higher-order states of 
the brain have a special subjective, qualitative, inner ontology. But 
he does nothing to tell us how that can be a fact. 

The first-person ontological status of conscious states is clearly cru- 
cial here. On the one hand, it is beefy and strange enough to rule 
out functionalism and physicalism. On the other, it is not beefy 
enough to provide an insuperable mystery for the idea that con- 
scious states are higher-order, causally efficacious, states of the brain. 
So it is a pity that Searie’s introduction of these notions is quite per- 
functory. Conscious states are “inner” just in the sense that they are 
states of the body and brain; they are “qualitative” in that it feels like 
something to be in such a state; and they are “subjective” in the 
sense that they are always experienced by a human or animal subject 
(42). Now, each of these can be taken as quite trivial consequences 
of the way we ascribe conscious states to persons, just parallel to the 
way in which dents are inner, qualitative, and subjective states of sur- 
faces. It is not at all clear that these aspects of conscious states are 
beefy enough to refute functionalism and physicalism. And it is still 
less clear that they can do this without leaving Searie’s own theory 
quite unintelligible, unless, as is so often the case, there are two dif- 
ferent grades of beef in play. So I doubt if Searle’s solution to the 
mind-body problem is really a solution at all. He may wish to avoid 
labels like ‘dualism’ and ‘materialism’, but this is not the same as 
avoiding the problems for which they are labels. We should be re- 
minded here of the issue between John Locke, who thought that is 
was “God’s good pleasure” to annex definite sensations to physical 
states, and Gottfried Leibniz, who complained that God did nothing 
arbitrarily, but that the mental must be a rationally intelligible ex- 
pression of the physical. Searle writes as if it solved things just to re- 
mind ourselves that the physical does cause the mental. But while 
matters remain there, the crucial issue between Locke and Leibniz 
never swims into focus. 

There are other places where Searle is a little kinder to his own 
ideas than to those of others. One concerns the place that “intrinsic 
intentionality” has in his system. According to Searle, some con- 
scious states possess intrinsic directedness or intentionality. By con- 
trast, other things, and notably sentences, possess intentionality that 
is only derived from the intrinsic intentionality of the conscious 
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states of their speakers (93). Yet Searle also wants to accommodate 
the fact that to think many things seems to require mastery of a lan- 
guage. His solution is that “The child starts with simple prelinguistic 
intentionality. It then learns a simple vocabulary that enables it to 
have a richer intentionality, which in turn enables a richer vocabu- 
lary, which in turn enables a richer intentionality—and so on up ina 
boot-strapping process” (153). Again, how exactly is this supposed 
to work? Consider an example. Suppose, as Jonathan Bennett once 
argued, that nonlinguistic babies or animals cannot think thoughts 
whose content concerns a remote past. Now suppose that mastery of 
the linguistic device of tense rectifies that. How can it do so if the 
linguistic device itself possesses meaning that is entirely derived from 
the “intrinsic intentionality” of persons who use it? We seem to be 
faced not so much with a bootstrap as with a vicious circle: the mean- 
ing of the devices of tense both creates and depends upon the con- 
tent of thought of the people who use it. There may be a way out of 
the circle, but it 1s not at all clear what it would be, and Searle is con- 
tent to leave it there. 

While such problems beset Searle’s system, they should not be al 
lowed to mask his real achievement in bringing some very difficult 
philosophy forcefully and vividly into the popular arena. 

SIMON BLACKBURN 
University of North Carolina/Chapel Hill 
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Consequences of Enlightenment, ANTHONY J. CASCARDI. New York: Cam- 
bridge University Press, 1999. viii + 268 p. Cloth $54.95, paper $19.95. 


Anthony Cascardi’s Consequences of Enlightenment bas three principal 
themes. First, as his title suggests, he is concerned about the legacy 
of the Enlightenment, specifically in connection with the Frankfurt- 
school tradition of critical theory. That tradition is thought to have 
encountered an impasse: its critique of instrumental reason seems so 
thorough that it leaves no room for noninstrumental rationality, yet 
such rationality is needed for the critique of instrumental reason to 
be possible in the first place. The impasse is, of course, not unique 
to critical theory, as it mars any critique of reason that aims to be sys- 
tematic—hence the relevance of Immanuel Kant throughout Cas- 
cardi’s book. His strategy is to analyze ways in which political 
critique can be “reevaluated in light of an aesthetic critique modeled 
along lines sketched out in Kant’s Critique of Judgment” such that this 
impasse can be avoided (1-2). Second, he discloses the Kantian 
roots of a number of modern and postmodern thinkers: Jurgen 
Habermas, Slavoj Zizec, Hannah Arendt, Jean-Francois Lyotard, 
Chantal Mouffe, Roberto Unger, Theodor Adorno, Richard Rorty, 
Roland Barthes (just to give a partial list of the theorists Cascardi 
discusses, only some of whom are Kantians). It turns out that Kant’s 
notion of aesthetic judgment, characterized by indeterminacy, is 
central in their respective conceptions of political critique. Third, as 
Cascardi traces these theorists to the Critique of Judgment while focus- 
ing on their political philosophies, he discloses a deep link in con- 
temporary continental philosophy between aesthetics and politics 
based on a broad conception of aesthetics, that is, one which is not 
confined to autonomous art but which, following a reading of Kant, 
encompasses the subjective realm of affectivity that is tied to a uni 
versal common sense. Aesthetic critique is thus the single issue run- 
ning through all three themes. (In fact, Aesthetics as Critique was 
originally the subtitle of Cascardi’s book, but it was left off, he said, 
in the final stage of publication.) 

Yet what does aesthetics—particularly the indeterminacy of aes- 
thetic judgment—contribute to our understanding of contemporary 
political critique? Addressing this question is a direct way of ascertain- 
ing whether Cascardi provides the conceptual resources to advance 
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the debate about critique he has opened up. While the concept of in- 
determinacy indeed has aesthetic and political consequences, I am 
afraid they are not always the ones Cascardi envisions. 

I 


Cascardi’s aim is to explore the indeterminacy that attaches to aes- 
thetic judgment, starting from the following problem. There is no 
objective principle of taste; rather, there is only a “subjective princi- 
ple—...the indeterminate idea of the supersensible within us” (68). 
Such a subjective, indeterminate principle should not be thought to 
render aesthetic judgment deficient, as it often has been, but should 
rather be seen as the basis of a unique form of judgment capable of 
opening up affective riches which have political consequences for 
the exercise of critique. Cascardi thus begins with an epistemologi- 
cal problem about aesthetic judgment, yet ends up with an argument 
about the political consequences of the “solution” to that problem. 
But how does a critique of aesthetics in the Kantian tradition lead to a 
notion of aesthetics as critique in postmodernity? Cascardi’s path from 
the indeterminacy of aesthetic judgment to politics takes us through 
the “larger project of boundary distinction, which is a concern that 
governs the entire Kantian critical project” (73). He is referring 
here to the separation between cognition (Critique of Pure Reason) 
and morality (Critique of Practwal Reason), and to the fact that the Cri- 
tque of Judgment is supposed to provide the mediating concept, judg- 
ment, between theoretical and practical reason. What relates the 
issue of the relations among Kant’s three Critiques to the issue of the 
indeterminacy of aesthetic judgment (and what thereby establishes 
the broader notion of aesthetics) is that the division between cogni- 
tion and morality (and, even more so, the desire for their reintegra- 
tion) is not something we know but is rather something that is 
“affectively apprehended” by the subject in the form of aesthetic 
judgment (44). To grasp aesthetic judgment in its indeterminacy is 
to grasp this realm of affectivity as a source of critique (as well as of 
possible mediation). 

But while it is clear that indeterminacy is a central notion for Cas- 
cardi, there seems to be some slippage or play among the following 
senses of it which, in turn, are tied to an equally important, though 
undeveloped concept of contingency which also has multiple mean- 
ings. Indeterminacy may be true to its name here, but at the same 
time, it may reveal the contingency of Cascardi’s own critical project. 

(1) The indeterminacy of aesthetic judgment. As we saw, aesthetic 
judgment is indeterminate because its principle is the “indetermrt 
nate idea of the supersensible within us.” In Kant’s language, it is re- 
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flective (indeterminate) rather than determinate because it has no 
determinate concepts, which means it cannot convey any knowledge 
(61-62). It is also subjective, as it is tied to a feeling of pleasure, yet it 
is still thought to be universal. With such an account of aesthetic 
judgment, it is unclear—-or indeterminate (perhaps more for Cas 
cardi than for Kant)—how it works, whether it is indeed universal, or 
what its philosophical role is. 

(2) The indeterminacy of art. Cascardi insists, following Kant, that 
the nature and identity of works of art be left unspecified (55); all we 
can say is that they are created by genius. So it is indeterminate how 
something becomes or is a work of art. 

(3) The indeterminacy of the aesthetic domain. Because aesthetic judg- 
ment and art are indeterminate, the aesthetic domain of philosophy 
is without rules, principles, or criteria (16, 59). So it is indeterminate 
what does or does not belong within this domain, especially when it is 
extended (by Cascardi) beyond the realm of autonomous art. 

(4) The indeterminacy of cognition and morality. The Kantian system 
has a “constitutive opacity” (91) because (as we saw) reason cannot ac- 
count for the division or unity of cognition and morality. Such an ac- 
count lies beyond the limits of reason, yet it is crucial for its legitimate 
exercise (this is the Kantian counterpart of critical theory’s impasse). 
What makes reason indeterminate once its limits become evident is 
that we do not know these limits through the use of determinate com 
cepts; rather, we ascertain them without concepts by feeling what lies 
beyond them. These feelings are reflected in aesthetic judgment, the 
structure of which is as indeterminate as the feelings it mirrors. 

To complicate matters further, these four senses of indeterminacy 
are linked to two different senses of contingency. (A) The contingency 
of the cognition/morahty distinction. Although aesthetic reflective judg- 
ment is supposed to be the middle term linking cognition and 
morality, it necessarily fails in this capacity precisely because it is in- 
determinate (72-77). Such failure shows that the divisions Kant es 
tablishes between cognition and morality are contingent (57), as is 
any unity of reason he might claim. Cascardi argues, however, that 
something very positive follows from this failure and contingency; 
for the appropriate philosophical response here is to recognize that 
there is no “‘solution’ to the division of cognition and morality or a 
proof of the unity of experience” (89). What we have instead is “an 
antinomy: that the principle of the universal validity of [judgments] 
of taste must indeed exist, but that the principle cannot be shown to 
exist” (89). Thus, while judgment fails as a mediator, its failure is its 
strength; for in trying to serve as a mediator, judgment establishes 
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that it is legitimate even though it cannot be proven by reason, be- 
cause we need judgment to complete the critique of reason, without 
which completion we cannot exercise reason legitimately. Thus the 
legitimacy of judgment is intimately tied up with that of reason itself. 

(B) The contingency of the world. The “fact” that the world is not 
structured according to any rational purposes is embodied in human 
affects, specifically pleasure/pain, and is subsequently reflected in 
aesthetic judgments. In short, our affects and judgments are traces 
of the world’s contingency. But what type of contingency are we 
dealing with here? That of history or of nature? Cascardi is not 
clear on this key issue, but it is likely he means both, given what he 
sees (below) as stemming from contingency. In fact, he says little 
else about contingency, which is rather surprising since it clearly 
plays an important role in his argument; for Cascardi claims that 
what Kant seeks in arguing for the universality of subjectively 
grounded aesthetic judgment is “a reappraisal of the role of the pas- 
sions in revealing how human purposes fit in relation to the contin- 
gency of the world” (55). 


Now, to mimic the title of Cascardi’s book, what are the enlighten- 
ment consequences of indeterminacy and contingency? First, inde- 
terminacy serves as the model for Cascardi’s notion of subjectivity. 
“The indeterminacy of the domain that Kant couches in affect thus 
becomes the basis for a conception of the subject as a site of loss or 
dispossession rather than for the recuperation of a positive relation- 
ship to the world” (11). Second, contingency underlies his notion of 
affectivity, for he claims that affects, such as pleasure or pain, are “in 
the realm of the subject, a trace of the contingency of the world” 
(48, 10). Combining these two points, the relationship between in- 
determinacy and contingency seems to be something like the follow 
ing: the contingency of the world is experienced (even suffered) by 
the subject, and such affective experience is mirrored in indeterminate 
aesthetic reflective judgments. Third, Cascardi says that critical 
force attaches to the indeterminacy of the aesthetic (174, 57); that is, 
it is possible “to locate a critical value in a form of judgment that 
takes its bearings by the affects and whose procedures are fundamen- 
tally reflective” (55). Fourth, one, if not the, major consequence of 
Cascardi’s multiple forms of indeterminacy and contingency is that 
they are crucial to political transformation (179), for they open up 
(public) space for politics (and ethics). The link between the aes- 
thetic and the political is provided by the common sense or affectiv- 
ity that grounds the universality of indeterminate judgment, for “the 
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affects provide an opening for an ethical determination of the 
subject-—ethical [and later also political] because determined by a 
responsibility to the other” (11). Affectivity is what is universally 
communicable across the aesthetic, ethical, and political spheres of 
indeterminate judgment. 

The aesthetics-politias link is most concrete in Cascardi’s charac- 
terization of democracy, which brings together his notions of sub- 
jectivity, affectivity, and critique. He argues that contemporary 
thinkers “interested in radicalizing democracy,” himself included, 
“have turned to the project of an aesthetically grounded critique as 
part of an effort to identify what the notions of communication and 
consensus seem to leave out: the ongoing desire of subjects to re- 
fashion the structures they inhabit, coupled with the desire to per- 
petuate democracy by recognizing that Enlightenment democratic 
universalism and its institutions were structurally, inherently incom- 
plete” (179). The resulting notion of democracy is as follows: 
“Democracy describes that form of politics suited to a subject that 
recognizes itself as having no essential nature, that refuses to under- 
stand itself as the effect of some external or underlying cause, and 
that regards change as inevitable insofar as closure is unattainable” 
(180). That is, just as we lack an essential or fixed nature, our polit- 
ical principles and institutions are constitutively incomplete (174, 
194); as a result, change and revision are the norms in politics. Po- 
litical critique is public reflection on what changes and revisions are 
desirable and possible. Modeled on the notion of aesthetic judg- 
ment, such critique should not focus on achieving consensus on de- 
terminate conceptions of liberal principles, such as “freedom” or 
“equality” (180). Rather, political critique should be guided by the 
(Lyotardian) question: “What becomes of the polis when the power 
of judgment is trained not upon some determinate object, but 
rather on what the determination of any object leaves out"—from, 
for example, feminist, racial, or ecological perspectives (181, 174; 
cf. 201-04)? Identification of such exclusion is the new political fo- 
cus instead of the ideal of consensus which, under conditions of 
contingency, can never be attained. Political judgment thus mirrors 
aesthetic judgment in that both are indeterminate and critical. 
Moreover, Cascardi claims they are critical because they are indeter- 
minate. We have thus arrived at the goal of his argument: the inde- 
terminate structure of aesthetic judgment is said to establish a 
subjective, affective basis for political critique aimed at creating a 
democratic analog of the universal common sense that is both pre- 
supposed, and created, by aesthetic judgments. 
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Adorno once remarked that while a work of art appears to be deter- 
minate when it is political, it is in fact indeterminate, though not for 
the reasons Cascardi provides. Rather, the work that appears to be 
determinate is actually indeterminate because it has been determined 
by so many historical factors that, with respect to their causality, are 
hard to pin down. Carrying Adorno’s dialectic one step further, the 
work appears to be indeterminate because of these many factors, yet 
they are what make it determinate. So the (political) work of art that 
first appears to be determinate turns out to be indeterminate, yet it 
appears to be indeterminate only because it is determinate. Analyz- 
ing these appearances instead of defending either determinacy or in- 
determinacy alone is the difficult task shared by aesthetic and 
political critique. In this light, if art is to be understood in terms of 
the notion of indeterminacy at all, then I would suggest that it be un- 
derstood in terms of the notion of indeterminate determtnacy. By this 
notion, I mean a determinacy which is shaped by issues definitive of 
the history of the present but which is open to multiple interpreta- 
tions so that it escapes the stranglehold of reigning ideologies and is 
thus able to engage them; critique lies in the determinate space be- 
tween this escape and engagement. Such indeterminate determinacy 
is to be distinguished from the notion of determinate indeterminacy, 
which is a commitment in art (often later mirrored in aesthetics) to 
be indeterminately related to, and thus distant from, history, as if 
art’s ability to be critical would lie in such indeterminacy or distance 
alone. This strikes me as the kind of indeterminacy Cascardi em- 
braces in his accounts of aesthetic as well as political judgment. But I 
think such indeterminacy is a symptom of art’s historical irrelevance 
couched in the reputedly redeeming concept of indeterminacy, as if 
an appeal to that concept could transform irrelevance (a fact) into a 
virtue (a norm). Art does not need to be redeemed in its irrelevance. 
Rather, like aesthetic and political judgment, it has to find a path to 
relevance that grounds critique. With this point in mind, I would like 
to raise several questions about Cascardi’s treatment of indetermi- 
nacy in connection with aesthetics and politics. 

(1) The ground of Cascardi’s aesthetics-politics link is the duo of 
(epistemological) indeterminacy and (ontological) contingency. 
But how could they possibly anchor political critique? Critique 
seems incapable of supporting any concrete political discourse, so 
long as it is grounded in nothing other than indeterminacy, even 
when indeterminacy is rendered more concrete by being tied to the 
notion of exclusion. But my criticism of Cascardi’s position is not 
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that political critique is not grounded philosophically; for I agree 
that it is groundless in the sense that it is contingent (240) and that, 
at the same time, it is grounded as much as it needs to be. My crit 
cism is rather that what he provides through his account of contem- 
porary political critique is a description (albeit an accurate one) of 
its predicament, namely, that it is indeterminate. The reason it is in- 
determinate, however, is not that it is modeled on aesthetic judg- 
ment, but because it lacks a determinate foothold in politics. The 
problem here, therefore, is not a philosophical one; if it were, the 
concept of indeterminacy may very well be a solution, since indeter- 
minate judgment is (self) grounded. The problem of political crit 
tique Cascardi describes is rather that it lacks a political grounding, a 
problem to which no philosophical concept alone could ever be the 
solution. The link here with contemporary aesthetics is that, with 
few exceptions, it is in a similar predicament: it lacks a determinate 
foothold in art. In this light, indeterminacy is the name of a prob- 
lem shared by aesthetic and political judgment rather than a re- 
source for solving it. To make this point using the earlier notion of 
exclusion, what is excluded here is either art or politics. 

(2) Cascardi inscribes into the ontology of art a certain impossibil 
ity which is expressed by Jacques Lacan, albeit in Cascardi’s words, as 
follows: “the category of ‘art’ comes into being within the culture of 
enlightened modernity not as the effect of any determinate set of 
causes but only as the consequences of an impossibility-—specifically, 
as a consequence of constituting the ‘whole’ of experience by bridg- 
ing the ‘two worlds’ that Kant had set aside in his first two critiques” 
(98). The impossibility of ever unifying experience, which is a fail 
ure of reason, is thus the origin of art (and of judgments thereof, 
since such failure has the effect, as we have seen, of validating aes- 
thetic judgment). Yet what is the rationale for inscribing such im- 
possibility, which, after all, is an epistemological matter, into the 
ontological heart of art? Why should art be ontologically deter- 
mined to be indeterminate because of an epistemological limitation 
of reason? Are the determinate causes of art really so obscure, how- 
ever complex they may be? Instead of addressing such questions, 
Cascardi has written his account of aesthetic judgment into his ac- 
count of art. The result is a conception of art defined by aesthetics 
conceived as “independent of the specificity of works of fine art” (2- 
3). Why should art be defined by an aesthetics deliberately sepa- 
rated from art? If Cascardi is going to appeal to aesthetics to 
understand. politica, I think he might first rethink art and then re- 
connect aesthetics to art. 
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(3) Of course, Cascardi makes it clear that he is disinterested in 
art because aesthetic judgment has relevance and application well 
beyond art. That may be true, but he might first reconsider his dis- 
interest because it has negative consequences for his own strategy. 
He explicitly works not only with a notion of aesthetics which is not 
confined to autonomous art but which is also separated from it. 
This means that aesthetic judgment (and critique) is likewise sepa- 
rated from art. Then what is aesthetic judgment a judgment of? Af- 
fectivity? Besides being rather indeterminate, that answer would not 
work because (the universal communicability of) affectivity is the 
ground of aesthetic judgment and so cannot be its object. Then 
what is? Contingency? It is not clear. This unclarity is a problem 
for Cascardi because he models political judgment on aesthetic judg- 
ment and is thus faced with a parallel problem: political judgment is 
analogously (if inadvertently) conceived as “independent of the 
specificity” of politics and is thus as separated from politics as aes- 
thetic judgment is from art. The consequence of such separation is 
irrelevance, and indeterminacy is its symptom. So indeterminacy is 
not, as Cascardi claims, what opens up the possibility of (aesthetic or 
political) critique. I am not suggesting that it (or he) has necessarily 
closed off this possibility but that by elevating indeterminacy to both 
a philosophical principle and a political norm, he has mistaken a 
symptom of critique’s irrelevance for a condition of its possibility. In 
a word, he has misidentified a symptom (indeterminacy) of the fact 
that we lack something (critique) as the norm to which we should as- 
pire, hoping it (indeterminacy) will eliminate the lack. That would 
be like identifying the hunger we feel when we lack food as the norm 
to which we should aspire, hoping it will alleviate our hunger despite 
the continued absence of food. 

MICHAEL KELLY 
Columbia University 


638 THE JOURNAL OF PHILOSOPHY 


In the Light of Logic. SOLOMON FEFERMAN. New York: Oxford Univer- 
sity Press, 1998. xii + 340 p. Cloth $49.95. 


In December of 1998, a conference was held at Stanford University 
in honor of Solomon Feferman’s seventieth birthday. Mathemati- 
clans, philosophers, computer scientists, historians, and even a lin- 
guist or two gathered from about a dozen countries to celebrate 
Feferman’s work in logic. Surveying this work, one detects a reassur- 
ing uniformity of method, and a general template for research in the 
field: first, find an issue relevant to the philosophy or history of 
mathematics that calls for clarification; next, design formal represen- 
tations of the relevant mathematical phenomena, and apply the 
methods of logic and proof theory to their analysis; finally, interpret 
the results of the inquiry with respect to the original issues. Fefer- 
man’s contributions to mathematical logic are substantial, but his 
work is perhaps best appreciated in this broader context. It is pre- 
cisely this interplay between mathematical logic and the philosophy 
of mathematics that constitutes the central theme of In the Light of 
Logi, a collection of reflective essays spanning the last two decades. 
These writings do not offer a grand unified theory of mathematics, 
but, rather, focus on more sharply delimited issues. For example, a 
number of essays, including “Infinity in Mathematics,” “Weyl Vindi- 
cated,” and “Why a Little Bit Goes a Long Way,” address the notion 
of predicaizvity. In an informal sense, the word is used to describe ap- 
proaches to mathematics in which concepts are introduced “from 
the bottom up,” in accordance with the Poincaré-Russell Vicious Cir- 
cle Principle. From such a standpoint, one may be willing to accept 
quantification over the natural numbers as unproblematic; but, tak- 
ing sets of natural numbers to be given by explicitly defined proper- 
ties, one may reject definitions that treat the collection, of all such 
sets as a completed totality. This informal stance forms the basis of 
Hermann Weyl’s landmark 1918 work, Das Kontinuum,' in which a 
significant portion of real analysis is developed on such a founda- 
tion. Since then, a good deal of effort has gone toward providing a 
formal analysis of predicativity, but no one has done more than Fe- 
ferman to shape our current understanding of the term. In the 
1960s, he identified a constellation of formal theories that seem to 
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exhaust the predicativist’s means, and using the methods of Gerhard 
Gentzen, Kurt Schutte, and the “Munich School” of proof theory, he 
determined the ordinal, I), that represents the least upper bound to 
their prooftheoretic strength. In more recent years, a number of 
theories have been shown to be prooftheoretically reducible to Fe- 
ferman’s, and most interested parties are willing to use this criterion 
to obtain a working definition of predicativity. In other words, the 
general feeling is that Feferman has got it right, both in providing a 
formal analysis of a certain foundational stance, and in identifying 
an interesting and significant segment of mathematical reasoning. 

Ongoing work in proof theory has shown that a good deal of 
mathematics can be carried out in predicative theories. In fact, 
many mathematical arguments can be formalized in theories that 
are weaker, and sometimes one can even get away with theories con- 
servative over primitive recursive arithmetic, which many take to rep- 
resent David Hilbert’s notion of finitistic reasoning. One route to 
obtaining results like these—dating back to Hilbert and Paul 
Bernays and vigorously pursued by Harvey Friedman, Stephen Simp- 
son, and others—is to use restricted subsystems of second-order 
arithmetic to carry out the formalization. A drawback of this ap- 
proach is that in these theories, mathematical objects (functions, 
fields, metric spaces, measures, and so on) have to be coded as sets 
of numbers, which explicitly restricts the discourse to mathematical 
objects that are countably presented. Feferman has developed a 
more flexible approach with his theories of explicit mathematics, 
where the countable interpretations lie, in a sense, behind the 
scenes. This difference aside, however, the two approaches paint 
pictures that are essentially in agreement as to “where” everyday 
mathematical reasoning takes place. 

Some of the conclusions that Feferman wishes to draw from this 
analysis are controversial. While Friedman and others have discov- 
ered examples of combinatorial theorems (like Kruskal’s theorem, 
and finitizations thereof) whose proofs are necessarily impredica- 
tive, Feferman has conjectured, on the basis of the evidence just de- 
scribed, that at least those branches of mathematics which have 
direct scientific applications can be developed on a predicative ba- 
sis. He takes this to weigh against arguments offered by W. V. 
Quine and Hilary Putnam in favor of a realist, set-theoretic ontol- 
ogy, which maintain that such a mathematical ontology is indispens- 
able to the sciences. Feferman laments the fact that arguments like 
these have, to a large extent, supplanted alternative philosophies 
for the foundations of mathematics, like constructivism or predica 
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trvism. In an essay provocatively titled “Infinity in Mathematics: Is 
Cantor Necessary?” he quotes Weyl in characterizing the Platonism 
which underlies Cantorian set theory as the “medieval metaphysics 
of mathematics,” and pronounces it “utterly unsatisfactory as a phi- 
losophy of our subject” (248). 

Feferman does not offer much detail as to the form of Platonism 
he takes to underlie set theory, however, nor does he discuss a num- 
ber of philosophical views, realist or otherwise, that can reasonably 
be associated with the subject. His efforts are directed, more specifi- 
cally, toward a critique of the indispensability arguments mentioned 
above. Suggesting that the Cantorian world view is dispensable is a 
sure-fire way of raising set theorists’ hackles, and so it is important to 
keep clear what is at stake. Feferman is not primarily concerned 
with offering a normative account of mathematical practice, but, 
rather, determining the appropriate logical and philosophical analy- 
sis thereof. A set theorist could respond to the proof-theoretic ac- 
count in a number of ways. First, she could argue that this account 
has something of a revisionist tinge: mathematicians use “large” sets 
and impredicative definitions freely, and though (after the fact, and 
in principle) we can often eliminate these features from a given the- 
ory, the impredicative conception may have been essential to the 
theory’s development, and may still be essential to a full understand- 
ing of its terms. As far as the outer reaches of set theory are con- 
cerned, she could defend the subject as a search for principles that 
might, in the future, play a central role in everyday mathematics. Af 
ter all, large cardinal axioms have consequences for the continuum, 
and Friedman has long been searching for interesting finitary conse- 
quences as well. If our scientific theories do not currently warrant 
granting large cardinals full ontological status, one day they might, 
and a liberal and open-minded view of mathematics should prevent 
us from adopting a philosophical stance that precludes this possibil- 
ity. Along these lines, our set theorist might even question the ex- 
tent to which a minimal mathematical ontology is desirable. If, as 
Georg Cantor and Hilbert maintained, the essence of mathematics 
lies in its freedom of abstraction, then the proof theorist’s drive to 
tether the subject to a restricted ontology misses the point; the re- 
sulting account of mathematics may soothe the apprehensive 
philosopher, but it may have little to do with mathematics itself. 

Feferman would, no doubt, have much to say in reply. But I think 
it is important to emphasize that there is more to the prooftheoretic 
program than the negative task of reining in profligate ontologies. 
A pair of essays titled “Foundational Ways” and “Working Founda- 
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tions” offer a thoughtful and spirited discussion of research in the 
foundations of mathematics, surveying a range of methodological 
approaches, emphasizing the positive gains of such research, and il- 
lustrating these gains with well-chosen examples. The benefits dis- 
cussed include the clarification of fundamental concepts, the 
resolution of conceptual problems and paradoxes, the systematic or- 
ganization of mathematical ideas, and the avenues toward general 
ization made possible by such a presentation. From this point of 
view, the proof-theoretic formalizations and reductions discussed 
above are to be valued for the additional information they provide 
with regard to the structure of the mathematical theories under con- 
sideration. For example, the development of real analysis in a re- 
stricted framework helps us determine which features of the 
mathematical universe are essential to the subject, and how they are 
used; it shows us that the theorems of analysis hold true not only in 
the set theorist’s well-stocked universe, but in restricted models as 
well; and it helps us locate the “concrete” content of the theory’s ab- 
stract concepts. As to the broader philosophical implications, Fefer- 
man writes in “What Rests on What?": 


It seems to me that the information provided by the kind of case studies 
described here involving the formalization of considerable tracts of 
everyday mathematics in appropriate systems, in combination with the 
results of their prooftheoretic reductions...must be taken into account 
in the continuing effort to develop a relevant and sustainable philoso- 
phy of mathematics (207). 


But with characteristic caution, he adds: 


Though I personally believe that the kinds of results described here on 
the whole strengthen the case for a nonplatonistic philosophy of mathe- 
matics and further undermine the case for set-theoretical realism, they 
do not speak for themselves to that extent, and it is at that point that a 
wellinformed philosophical discussion must take over (ibid. }. 


In other words, the results of the prooftheoretic inquiry seem to tell 
us something about mathematics, and should be taken into account 
but the philosophical mileage one obtains from these results may vary. 

In the Light of Logic is divided into five parts. The first part focuses 
on historical topics and surveys some of the major lines of thought 
that have shaped the development of mathematical logic in this cen- 
tury. The second part provides a general discussion of the founda- 
tional enterprise and includes a balanced critique of Imre Lakatos’s 
views on mathematics; with regard to the latter, Feferman finds 
“much to agree with in both the general approach and detailed 
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analysis,” but ultimately concludes that the theory taken in isolation 
fails to account for the distinctive character of mathematics (78). 
The third part examines the work of Kurt Godel and weaves together 
aspects of his life, mathematics, and philosophy in a moving way. 
The fourth part outlines the proof-theoretic view of mathematics 
embodied in the kinds of results discussed above; and the fifth ex- 
plores the consequences of these results for the philosophy of math- 
ematics, largely vis-a-vis indispensability arguments. With prose that 
is crisp and precise yet accessible to a general audience, this book is 
essential reading for anyone interested in these topics,. 

For the most part, the research discussed in these essays is founda- 
tional in the traditional sense: the logician’s task is seen to be that of 
devising formal accounts that characterize the basic stuff of which 
the mathematical universe is composed, and the philosopher's task 
is seen to be that of providing epistemological and ontological justifi- 
cations of mathematics, as portrayed by these accounts. Today, 
there is a growing sense that these are not the only issues in the phi- 
losophy of mathematics which are worthy of sustained inquiry. For 
example, one may seek accounts of mathematics which illuminate 
the structural view of the subject that has been on the rise since the 
Cantor-Dedekind revolution of the last century, theories of meaning 
and rationality that explain how mathematical theories grow and 
evolve over time; or theories of discovery and explanation in mathe- 
matics, akin to those which have been developed for the sciences. 
But the tone of Feferman’s writing suggests that he would not be 
averse to investigations of this sort, and traditionalists among us like 
to think that the general methodological approach exhibited in 
these essays has much to offer in that regard. In the end, the extent 
to which mathematical approaches can inform philosophical inquiry 
is subject to debate; but the rigor and clarity of Feferman’s essays 
make a strong case on their behalf. 


JEREMY AVIGAD 
Carnegie Mellon University 
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